April 4, 2024, 4:42 a.m. | Rudra P. K. Poudel, Harit Pandya, Stephan Liwicki, Roberto Cipolla

cs.LG updates on arXiv.org arxiv.org

arXiv:2312.09056v2 Announce Type: replace
Abstract: While recent model-free Reinforcement Learning (RL) methods have demonstrated human-level effectiveness in gaming environments, their success in everyday tasks like visual navigation has been limited, particularly under significant appearance variations. This limitation arises from (i) poor sample efficiency and (ii) over-fitting to training scenarios. To address these challenges, we present a world model that learns invariant features using (i) contrastive unsupervised learning and (ii) an intervention-invariant regularizer. Learning an explicit representation of the world dynamics …

abstract arxiv cs.ai cs.cv cs.lg cs.ro efficiency environments free gaming human navigation reinforcement reinforcement learning representation representation learning sample stat.ml success tasks training type visual visual navigation world world model

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Senior Data Engineer

@ Quantexa | Sydney, New South Wales, Australia

Staff Analytics Engineer

@ Warner Bros. Discovery | NY New York 230 Park Avenue South