Oct. 6, 2022, 1:12 a.m. | David Brandfonbrener, Stephen Tu, Avi Singh, Stefan Welker, Chad Boodoo, Nikolai Matni, Jake Varley

cs.LG updates on arXiv.org arxiv.org

We consider how to most efficiently leverage teleoperator time to collect
data for learning robust image-based value functions and policies for sparse
reward robotic tasks. To accomplish this goal, we modify the process of data
collection to include more than just successful demonstrations of the desired
task. Instead we develop a novel protocol that we call Visual Backtracking
Teleoperation (VBT), which deliberately collects a dataset of visually similar
failures, recoveries, and successes. VBT data collection is particularly useful
for efficiently …

arxiv backtracking collection data data collection image offline protocol reinforcement reinforcement learning

AI Research Scientist

@ Vara | Berlin, Germany and Remote

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Data Science Analyst

@ Mayo Clinic | AZ, United States

Sr. Data Scientist (Network Engineering)

@ SpaceX | Redmond, WA