all AI news
Evolutionary Action Selection for Gradient-based Policy Learning. (arXiv:2201.04286v4 [cs.NE] UPDATED)
Sept. 19, 2022, 1:12 a.m. | Yan Ma, Tianxing Liu, Bingsheng Wei, Yi Liu, Kang Xu, Wei Li
cs.LG updates on arXiv.org arxiv.org
Evolutionary Algorithms (EAs) and Deep Reinforcement Learning (DRL) have
recently been integrated to take the advantage of the both methods for better
exploration and exploitation.The evolutionary part in these hybrid methods
maintains a population of policy networks.However, existing methods focus on
optimizing the parameters of policy network, which is usually high-dimensional
and tricky for EA.In this paper, we shift the target of evolution from
high-dimensional parameter space to low-dimensional action space.We propose
Evolutionary Action Selection-Twin Delayed Deep Deterministic Policy Gradient …
More from arxiv.org / cs.LG updates on arXiv.org
Jobs in AI, ML, Big Data
Data Architect
@ University of Texas at Austin | Austin, TX
Data ETL Engineer
@ University of Texas at Austin | Austin, TX
Lead GNSS Data Scientist
@ Lurra Systems | Melbourne
Senior Machine Learning Engineer (MLOps)
@ Promaton | Remote, Europe
Business Intelligence Analyst
@ Rappi | COL-Bogotá
Applied Scientist II
@ Microsoft | Redmond, Washington, United States