March 21, 2024, 4:42 a.m. | Dipendra Misra, Akanksha Saran, Tengyang Xie, Alex Lamb, John Langford

cs.LG updates on arXiv.org arxiv.org

arXiv:2403.13765v1 Announce Type: new
Abstract: We study pre-training representations for decision-making using video data, which is abundantly available for tasks such as game agents and software testing. Even though significant empirical advances have been made on this problem, a theoretical understanding remains absent. We initiate the theoretical investigation into principled approaches for representation learning and focus on learning the latent state representations of the underlying MDP using video data. We study two types of settings: one where there is iid …

abstract advances agents arxiv cs.ai cs.cv cs.lg data decision game investigation making pre-training reinforcement reinforcement learning representation representation learning software software testing study tasks testing training type understanding video video data videos

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

Lead Developer (AI)

@ Cere Network | San Francisco, US

Research Engineer

@ Allora Labs | Remote

Ecosystem Manager

@ Allora Labs | Remote

Founding AI Engineer, Agents

@ Occam AI | New York

AI Engineer Intern, Agents

@ Occam AI | US