Scalable Model-based Policy Optimization for Decentralized Networked Systems. (arXiv:2207.06559v2 [cs.LG] UPDATED) | allainews.com

Sept. 5, 2022, 1:12 a.m. | Yali Du, Chengdong Ma, Yuchen Liu, Runji Lin, Hao Dong, Jun Wang, Yaodong Yang

cs.LG updates on arXiv.org arxiv.org

Reinforcement learning algorithms require a large amount of samples; this
often limits their real-world applications on even simple tasks. Such a
challenge is more outstanding in multi-agent tasks, as each step of operation
is more costly requiring communications or shifting or resources. This work
aims to improve data efficiency of multi-agent control by model-based learning.
We consider networked systems where agents are cooperative and communicate only
locally with their neighbors, and propose the decentralized model-based policy
optimization framework (DMPO). In …

arxiv decentralized optimization policy scalable systems

More from arxiv.org / cs.LG updates on arXiv.org

PPNet: A Two-Stage Neural Network for End-to-end Path Planning 5 hours ago | arxiv.org

abstract arxiv cs.ai cs.lg +14

Tenplex: Dynamic Parallelism for Deep Learning using Parallelizable Tensor Collections 5 hours ago | arxiv.org

abstract arxiv cs.ai cs.dc +16

From Reactive to Proactive Volatility Modeling with Hemisphere Neural Networks 5 hours ago | arxiv.org

abstract architecture arxiv context +23

DGR: Tackling Drifted and Correlated Noise in Quantum Error Correction via Decoding Graph Re-weighting 5 hours ago | arxiv.org

abstract applications arxiv cs.ar +18

A Single-Loop Algorithm for Decentralized Bilevel Optimization 5 hours ago | arxiv.org

abstract algorithm applications arxiv +13

Watch Out! Simple Horizontal Class Backdoors Can Trivially Evade Defenses 5 hours ago | arxiv.org

abstract arxiv attacks backdoor +13

Mixtures of Gaussians are Privately Learnable with a Polynomial Number of Samples 5 hours ago | arxiv.org

abstract alpha arxiv cs.cr +16

CLEANing Cygnus A deep and fast with R2D2 5 hours ago | arxiv.org

abstract arxiv astronomy astro-ph.im +17

Feature Imitating Networks Enhance The Performance, Reliability And Speed Of Deep Learning On Biomedical Image … 5 hours ago | arxiv.org

abstract arxiv biomedical cs.cv +21

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

View on ai-jobs.net

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

View on ai-jobs.net

Data Analyst

@ Aviva | UK - Norwich - Carrara - 1st Floor

View on ai-jobs.net

Werkstudent im Bereich Performance Engineering mit Computer Vision (w/m/div.) - anteilig remote

@ Bosch Group | Stuttgart, Lollar, Germany

View on ai-jobs.net

Applied Research Scientist - NLP (Senior)

@ Snorkel AI | Hybrid / San Francisco, CA

View on ai-jobs.net

Associate Principal Engineer, Machine Learning

@ Nagarro | Remote, India

View on ai-jobs.net