Jan. 21, 2022, 2:11 a.m. | Prashant Trivedi, Nandyala Hemachandra

cs.LG updates on arXiv.org arxiv.org

Multi-agent actor-critic algorithms are an important part of the
Reinforcement Learning paradigm. We propose three fully decentralized
multi-agent natural actor-critic (MAN) algorithms in this work. The objective
is to collectively find a joint policy that maximizes the average long-term
return of these agents. In the absence of a central controller and to preserve
privacy, agents communicate some information to their neighbors via a
time-varying communication network. We prove convergence of all the 3 MAN
algorithms to a globally asymptotically stable …

algorithms arxiv learning natural reinforcement learning

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Data Analyst

@ SEAKR Engineering | Englewood, CO, United States

Data Analyst II

@ Postman | Bengaluru, India

Data Architect

@ FORSEVEN | Warwick, GB

Director, Data Science

@ Visa | Washington, DC, United States

Senior Manager, Data Science - Emerging ML

@ Capital One | McLean, VA