Oct. 19, 2022, 1:13 a.m. | Xuyang Chen, Lin Zhao

stat.ML updates on arXiv.org arxiv.org

Despite the great empirical success of actor-critic methods, its finite-time
convergence is still poorly understood in its most practical form. In
particular, the analysis of single-timescale actor-critic presents significant
challenges due to the highly inaccurate critic estimation and the complex error
propagation dynamics over iterations. Existing works on analyzing
single-timescale actor-critic only focus on the i.i.d. sampling or tabular
setting for simplicity, which is rarely the case in practical applications. We
consider the more practical online single-timescale actor-critic algorithm on …

actor-critic analysis arxiv

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Senior Principal, Product Strategy Operations, Cloud Data Analytics

@ Google | Sunnyvale, CA, USA; Austin, TX, USA

Data Scientist - HR BU

@ ServiceNow | Hyderabad, India