TACO: Temporal Latent Action-Driven Contrastive Loss for Visual Reinforcement Learning | allainews.com

May 27, 2024, 4:44 a.m. | Ruijie Zheng, Xiyao Wang, Yanchao Sun, Shuang Ma, Jieyu Zhao, Huazhe Xu, Hal Daum\'e III, Furong Huang

cs.LG updates on arXiv.org arxiv.org

arXiv:2306.13229v3 Announce Type: replace
Abstract: Despite recent progress in reinforcement learning (RL) from raw pixel data, sample inefficiency continues to present a substantial obstacle. Prior works have attempted to address this challenge by creating self-supervised auxiliary tasks, aiming to enrich the agent's learned representations with control-relevant information for future state prediction. However, these objectives are often insufficient to learn representations that can represent the optimal policy or value function, and they often consider tasks with small, abstract discrete action spaces …

abstract action agent arxiv challenge control cs.ai cs.lg data future information loss pixel prior progress raw reinforcement reinforcement learning replace sample state tasks temporal type visual

More from arxiv.org / cs.LG updates on arXiv.org

Revisiting Active Learning in the Era of Vision Foundation Models 9 hours ago | arxiv.org

active learning arxiv cs.cv cs.lg +4

Fast gradient-free activation maximization for neurons in spiking neural networks 9 hours ago | arxiv.org

abstract artificial arxiv cognitive +16

Diverse Part Synthesis for 3D Shape Creation 9 hours ago | arxiv.org

abstract applications arxiv cs.cv +15

SoK: Facial Deepfake Detectors 9 hours ago | arxiv.org

abstract arxiv cs.cr cs.cv +19

XCube: Large-Scale 3D Generative Modeling using Sparse Voxel Hierarchies 9 hours ago | arxiv.org

arxiv cs.cv cs.gr cs.lg +7

Accelerating Electronic Stopping Power Predictions by 10 Million Times with a Combination of Time-Dependent Density … 9 hours ago | arxiv.org

abstract arxiv combination cond-mat.mtrl-sci +24

Jigsaw: Supporting Designers to Prototype Multimodal Applications by Chaining AI Foundation Models 9 hours ago | arxiv.org

abstract ai foundation ai foundation models applications +21

Analysis of learning a flow-based generative model from limited sample complexity 9 hours ago | arxiv.org

abstract analysis arxiv autoencoder +13

PiPar: Pipeline Parallelism for Collaborative Machine Learning 9 hours ago | arxiv.org

abstract arxiv collaborative cs.dc +19

AI Focused Biochemistry Postdoctoral Fellow

@ Lawrence Berkeley National Lab | Berkeley, CA

View on ai-jobs.net

Senior Data Engineer

@ Displate | Warsaw

View on ai-jobs.net

Staff Software Engineer (Data Platform)

@ Phaidra | Remote

View on ai-jobs.net

Distributed Compute Engineer

@ Magic | San Francisco

View on ai-jobs.net

Power Platform Developer/Consultant

@ Euromonitor | Bengaluru, Karnataka, India

View on ai-jobs.net

Finance Project Senior Manager

@ QIMA | London, United Kingdom

View on ai-jobs.net