all AI news
Finite Horizon Q-learning: Stability, Convergence, Simulations and an application on Smart Grids. (arXiv:2110.15093v2 [cs.LG] UPDATED)
Web: http://arxiv.org/abs/2110.15093
May 4, 2022, 1:12 a.m. | Vivek VP, Dr.Shalabh Bhatnagar
cs.LG updates on arXiv.org arxiv.org
Q-learning is a popular reinforcement learning algorithm. This algorithm has
however been studied and analysed mainly in the infinite horizon setting. There
are several important applications which can be modeled in the framework of
finite horizon Markov decision processes. We develop a version of Q-learning
algorithm for finite horizon Markov decision processes (MDP) and provide a full
proof of its stability and convergence. Our analysis of stability and
convergence of finite horizon Q-learning is based entirely on the ordinary
differential …
application arxiv convergence learning on q-learning simulations smart
More from arxiv.org / cs.LG updates on arXiv.org
Latest AI/ML/Big Data Jobs
Data Analyst, Patagonia Action Works
@ Patagonia | Remote
Data & Insights Strategy & Innovation General Manager
@ Chevron Services Company, a division of Chevron U.S.A Inc. | Houston, TX
Faculty members in Research areas such as Bayesian and Spatial Statistics; Data Privacy and Security; AI/ML; NLP; Image and Video Data Analysis
@ Ahmedabad University | Ahmedabad, India
Director, Applied Mathematics & Computational Research Division
@ Lawrence Berkeley National Lab | Berkeley, Ca
Business Data Analyst
@ MainStreet Family Care | Birmingham, AL
Assistant/Associate Professor of the Practice in Business Analytics
@ Georgetown University McDonough School of Business | Washington DC