On the Convergence of Policy in Unregularized Policy Mirror Descent. (arXiv:2205.08176v2 [math.OC] UPDATED) | allainews.com

May 20, 2022, 1:12 a.m. | Dachao Lin, Zhihua Zhang

cs.LG updates on arXiv.org arxiv.org

In this short note, we give the convergence analysis of the policy in the
recent famous policy mirror descent (PMD). We mainly consider the unregularized
setting following [11] with generalized Bregman divergence. The difference is
that we directly give the convergence rates of policy under generalized Bregman
divergence. Our results are inspired by the convergence of value function in
previous works and are an extension study of policy mirror descent. Though some
results have already appeared in previous work, we …

arxiv convergence math policy

More from arxiv.org / cs.LG updates on arXiv.org

Stochastic Optimal Control Matching 23 hours ago | arxiv.org

arxiv control cs.lg cs.na +6

Value Approximation for Two-Player General-Sum Differential Games with State Constraints 23 hours ago | arxiv.org

abstract approximation arxiv constraints +20

Can We Edit Multimodal Large Language Models? 23 hours ago | arxiv.org

arxiv cs.ai cs.cl cs.cv +9

XIMAGENET-12: An Explainable AI Benchmark Dataset for Model Robustness Evaluation 23 hours ago | arxiv.org

ai benchmark arxiv benchmark cs.cv +7

Generalized Schr\"odinger Bridge Matching 23 hours ago | arxiv.org

arxiv bridge cs.lg generalized +3

Tight bounds on Pauli channel learning without entanglement 23 hours ago | arxiv.org

abstract algorithms arxiv cs.it +9

Automated mapping of virtual environments with visual predictive coding 23 hours ago | arxiv.org

abstract access algorithms arxiv +28

Confident Feature Ranking 23 hours ago | arxiv.org

abstract arxiv cs.ai cs.lg +14

Integrated Sensing-Communication-Computation for Edge Artificial Intelligence 23 hours ago | arxiv.org

abstract advanced and edge ai artificial +27

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

View on ai-jobs.net

Data Analyst (CPS-GfK)

@ GfK | Bucharest

View on ai-jobs.net

Consultant Data Analytics IT Digital Impulse - H/F

@ Talan | Paris, France

View on ai-jobs.net

Data Analyst

@ Experian | Mumbai, India

View on ai-jobs.net

Data Scientist

@ Novo Nordisk | Princeton, NJ, US

View on ai-jobs.net

Data Architect IV

@ Millennium Corporation | United States

View on ai-jobs.net