Sept. 5, 2022, 1:13 a.m. | Yali Du, Chengdong Ma, Yuchen Liu, Runji Lin, Hao Dong, Jun Wang, Yaodong Yang

stat.ML updates on arXiv.org arxiv.org

Reinforcement learning algorithms require a large amount of samples; this
often limits their real-world applications on even simple tasks. Such a
challenge is more outstanding in multi-agent tasks, as each step of operation
is more costly requiring communications or shifting or resources. This work
aims to improve data efficiency of multi-agent control by model-based learning.
We consider networked systems where agents are cooperative and communicate only
locally with their neighbors, and propose the decentralized model-based policy
optimization framework (DMPO). In …

arxiv decentralized optimization policy scalable systems

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Data Analyst (CPS-GfK)

@ GfK | Bucharest

Consultant Data Analytics IT Digital Impulse - H/F

@ Talan | Paris, France

Data Analyst

@ Experian | Mumbai, India

Data Scientist

@ Novo Nordisk | Princeton, NJ, US

Data Architect IV

@ Millennium Corporation | United States