Web: http://arxiv.org/abs/2205.02561

May 6, 2022, 1:11 a.m. | Mingyu Yang, Jian Zhao, Xunhan Hu, Wengang Zhou, Houqiang Li

cs.LG updates on arXiv.org arxiv.org

Cooperative multi-agent reinforcement learning (MARL) has made prominent
progress in recent years. For training efficiency and scalability, most of the
MARL algorithms make all agents share the same policy or value network.
However, many complex multi-agent tasks require agents with a variety of
specific abilities to handle different subtasks. Sharing parameters
indiscriminately may lead to similar behaviors across all agents, which will
limit the exploration efficiency and be detrimental to the final performance.
To balance the training complexity and the …

arxiv learning reinforcement reinforcement learning

More from arxiv.org / cs.LG updates on arXiv.org

Director, Applied Mathematics & Computational Research Division

@ Lawrence Berkeley National Lab | Berkeley, Ca

Business Data Analyst

@ MainStreet Family Care | Birmingham, AL

Assistant/Associate Professor of the Practice in Business Analytics

@ Georgetown University McDonough School of Business | Washington DC

Senior Data Science Writer

@ NannyML | Remote

Director of AI/ML Engineering

@ Armis Industries | Remote (US only), St. Louis, California

Digital Analytics Manager

@ Patagonia | Ventura, California