Web: http://arxiv.org/abs/2205.13746

Sept. 23, 2022, 1:12 a.m. | Sihan Zeng, Thinh T. Doan, Justin Romberg

cs.LG updates on arXiv.org arxiv.org

We study the problem of finding the Nash equilibrium in a two-player zero-sum
Markov game. Due to its formulation as a minimax optimization program, a
natural approach to solve the problem is to perform gradient descent/ascent
with respect to each player in an alternating fashion. However, due to the
non-convexity/non-concavity of the underlying objective function, theoretical
understandings of this method are limited. In our paper, we consider solving an
entropy-regularized variant of the Markov game. The regularization introduces
structure into …

arxiv games gradient markov math

