March 26, 2024, 4:42 a.m. | Samuel Chun-Hei Lam, Justin Sirignano, Ziheng Wang

cs.LG updates on arXiv.org arxiv.org

arXiv:2403.16825v1 Announce Type: new
Abstract: We prove that a single-layer neural network trained with the online actor critic algorithm converges in distribution to a random ordinary differential equation (ODE) as the number of hidden units and the number of training steps $\rightarrow \infty$. In the online actor-critic algorithm, the distribution of the data samples dynamically changes as the model is updated, which is a key challenge for any convergence analysis. We establish the geometric ergodicity of the data samples under …

abstract actor actor-critic algorithm algorithms analysis arxiv convergence cs.lg differential differential equation distribution equation hidden layer math.oc math.pr network neural network ordinary prove random stat.ml training type units

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Senior Principal, Product Strategy Operations, Cloud Data Analytics

@ Google | Sunnyvale, CA, USA; Austin, TX, USA

Data Scientist - HR BU

@ ServiceNow | Hyderabad, India