April 4, 2024, 4:42 a.m. | Rudra P. K. Poudel, Harit Pandya, Stephan Liwicki, Roberto Cipolla

cs.LG updates on arXiv.org arxiv.org

arXiv:2312.09056v2 Announce Type: replace
Abstract: While recent model-free Reinforcement Learning (RL) methods have demonstrated human-level effectiveness in gaming environments, their success in everyday tasks like visual navigation has been limited, particularly under significant appearance variations. This limitation arises from (i) poor sample efficiency and (ii) over-fitting to training scenarios. To address these challenges, we present a world model that learns invariant features using (i) contrastive unsupervised learning and (ii) an intervention-invariant regularizer. Learning an explicit representation of the world dynamics …

abstract arxiv cs.ai cs.cv cs.lg cs.ro efficiency environments free gaming human navigation reinforcement reinforcement learning representation representation learning sample stat.ml success tasks training type visual visual navigation world world model

Software Engineer for AI Training Data (School Specific)

@ G2i Inc | Remote

Software Engineer for AI Training Data (Python)

@ G2i Inc | Remote

Software Engineer for AI Training Data (Tier 2)

@ G2i Inc | Remote

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

Lead Developer (AI)

@ Cere Network | San Francisco, US