Policy Gradient Methods Find the Nash Equilibrium in N-player General-sum Linear-quadratic Games | allainews.com

Jan. 1, 2023, midnight | Ben Hambly, Renyuan Xu, Huining Yang

JMLR www.jmlr.org

We consider a general-sum N-player linear-quadratic game with stochastic dynamics over a finite horizon and prove the global convergence of the natural policy gradient method to the Nash equilibrium. In order to prove convergence of the method we require a certain amount of noise in the system. We give a condition, essentially a lower bound on the covariance of the noise in terms of the model parameters, in order to guarantee convergence. We illustrate our results with numerical experiments to …

convergence dynamics equilibrium game games general global gradient linear nash equilibrium natural noise policy stochastic

More from www.jmlr.org / JMLR

Deep Network Approximation: Beyond ReLU to Diverse Activation Functions 4 months, 1 week ago | www.jmlr.org

approximation beyond diverse function +10

Model-Free Representation Learning and Exploration in Low-Rank MDPs 4 months, 1 week ago | www.jmlr.org

algorithms contrast dynamics exploration +9

Effect-Invariant Mechanisms for Policy Generalization 4 months, 1 week ago | www.jmlr.org

adapt challenge environments exploit +7

Pygmtools: A Python Graph Matching Toolkit 4 months, 1 week ago | www.jmlr.org

applications collection free graph +8

Power of knockoff: The impact of ranking algorithm, augmented design, and symmetric statistic 4 months, 1 week ago | www.jmlr.org

algorithm components control design +11

Heterogeneous-Agent Reinforcement Learning 4 months, 1 week ago | www.jmlr.org

agent agents ai research convergence +10

Sample-efficient Adversarial Imitation Learning 4 months, 1 week ago | www.jmlr.org

advanced adversarial behavior decision +13

Stochastic Modified Flows, Mean-Field Limits and Dynamics of Stochastic Gradient Descent 4 months, 1 week ago | www.jmlr.org

diffusion dynamics gradient mean +4

Rates of convergence for density estimation with generative adversarial networks 4 months, 1 week ago | www.jmlr.org

adversarial convergence divergence error +11

Lead Developer (AI)

@ Cere Network | San Francisco, US

View on ai-jobs.net

Research Engineer

@ Allora Labs | Remote

View on ai-jobs.net

Ecosystem Manager

@ Allora Labs | Remote

View on ai-jobs.net

Founding AI Engineer, Agents

@ Occam AI | New York

View on ai-jobs.net

AI Engineer Intern, Agents

@ Occam AI | US

View on ai-jobs.net

AI Research Scientist

@ Vara | Berlin, Germany and Remote

View on ai-jobs.net