March 26, 2024, 4:42 a.m. | Samuel Chun-Hei Lam, Justin Sirignano, Ziheng Wang

cs.LG updates on arXiv.org arxiv.org

arXiv:2403.16825v1 Announce Type: new
Abstract: We prove that a single-layer neural network trained with the online actor critic algorithm converges in distribution to a random ordinary differential equation (ODE) as the number of hidden units and the number of training steps $\rightarrow \infty$. In the online actor-critic algorithm, the distribution of the data samples dynamically changes as the model is updated, which is a key challenge for any convergence analysis. We establish the geometric ergodicity of the data samples under …

abstract actor actor-critic algorithm algorithms analysis arxiv convergence cs.lg differential differential equation distribution equation hidden layer math.oc math.pr network neural network ordinary prove random stat.ml training type units

Software Engineer for AI Training Data (School Specific)

@ G2i Inc | Remote

Software Engineer for AI Training Data (Python)

@ G2i Inc | Remote

Software Engineer for AI Training Data (Tier 2)

@ G2i Inc | Remote

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

Lead Developer (AI)

@ Cere Network | San Francisco, US