Oct. 24, 2022, 1:11 a.m. | Elias Hanna, Alex Coninx, Stéphane Doncieux

cs.LG updates on arXiv.org arxiv.org

This paper studies the impact of the initial data gathering method on the
subsequent learning of a dynamics model. Dynamics models approximate the true
transition function of a given task, in order to perform policy search directly
on the model rather than on the costly real system. This study aims to
determine how to bootstrap a model as efficiently as possible, by comparing
initialization methods employed in two different policy search frameworks in
the literature. The study focuses on the …

arxiv bootstrapping policy random search

Founding AI Engineer, Agents

@ Occam AI | New York

AI Engineer Intern, Agents

@ Occam AI | US

AI Research Scientist

@ Vara | Berlin, Germany and Remote

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne