Feb. 6, 2024, 5:45 a.m. | Long Ma Yuanfei Wang Fangwei Zhong Song-Chun Zhu Yizhou Wang

cs.LG updates on arXiv.org arxiv.org

Fast adapting to unknown peers (partners or opponents) with different strategies is a key challenge in multi-agent games. To do so, it is crucial for the agent to efficiently probe and identify the peer's strategy, as this is the prerequisite for carrying out the best response in adaptation. However, it is difficult to explore the strategies of unknown peers, especially when the games are partially observable and have a long horizon. In this paper, we propose a peer identification reward, …

agent challenge context cs.ai cs.lg cs.ma exploration explore games identify key multi-agent partners peer probe strategies strategy

Founding AI Engineer, Agents

@ Occam AI | New York

AI Engineer Intern, Agents

@ Occam AI | US

AI Research Scientist

@ Vara | Berlin, Germany and Remote

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne