all AI news
CACTO: Continuous Actor-Critic with Trajectory Optimization -- Towards global optimality. (arXiv:2211.06625v1 [cs.RO])
Nov. 15, 2022, 2:11 a.m. | Gianluigi Grandesso, Gastone P. Rosati Papini, Patrick M. Wensing, Andrea Del Prete
cs.LG updates on arXiv.org arxiv.org
This paper presents a novel algorithm for the continuous control of dynamical
systems that combines Trajectory Optimization (TO) and Reinforcement Learning
(RL) in a single framework. The motivations behind this algorithm are the two
main limitations of TO and RL when applied to continuous nonlinear systems to
minimize a non-convex cost function. Specifically, TO can get stuck in poor
local minima when the search is not initialized close to a ``good'' minimum. On
the other hand, when dealing with continuous …
More from arxiv.org / cs.LG updates on arXiv.org
Jobs in AI, ML, Big Data
Data Architect
@ University of Texas at Austin | Austin, TX
Data ETL Engineer
@ University of Texas at Austin | Austin, TX
Lead GNSS Data Scientist
@ Lurra Systems | Melbourne
Senior Machine Learning Engineer (MLOps)
@ Promaton | Remote, Europe
Senior Principal, Product Strategy Operations, Cloud Data Analytics
@ Google | Sunnyvale, CA, USA; Austin, TX, USA
Data Scientist - HR BU
@ ServiceNow | Hyderabad, India