all AI news
CGAR: Critic Guided Action Redistribution in Reinforcement Leaning. (arXiv:2206.11494v1 [cs.LG])
Web: http://arxiv.org/abs/2206.11494
June 24, 2022, 1:10 a.m. | Tairan Huang, Xu Li, Hao Li, Mingming Sun, Ping Li
cs.LG updates on arXiv.org arxiv.org
Training a game-playing reinforcement learning agent requires multiple
interactions with the environment. Ignorant random exploration may cause a
waste of time and resources. It's essential to alleviate such waste. As
discussed in this paper, under the settings of the off-policy actor critic
algorithms, we demonstrate that the critic can bring more expected discounted
rewards than or at least equal to the actor. Thus, the Q value predicted by the
critic is a better signal to redistribute the action originally sampled …
More from arxiv.org / cs.LG updates on arXiv.org
Latest AI/ML/Big Data Jobs
Machine Learning Researcher - Saalfeld Lab
@ Howard Hughes Medical Institute - Chevy Chase, MD | Ashburn, Virginia
Project Director, Machine Learning in US Health
@ ideas42.org | Remote, US
Data Science Intern
@ NannyML | Remote
Machine Learning Engineer NLP/Speech
@ Play.ht | Remote
Research Scientist, 3D Reconstruction
@ Yembo | Remote, US
Clinical Assistant or Associate Professor of Management Science and Systems
@ University at Buffalo | Buffalo, NY