March 28, 2024, 4:45 a.m. | Valay Bundele, Mahesh Bhupati, Biplab Banerjee, Aditya Grover

cs.CV updates on arXiv.org arxiv.org

arXiv:2403.18454v1 Announce Type: new
Abstract: The study of vision-and-language navigation (VLN) has typically relied on expert trajectories, which may not always be available in real-world situations due to the significant effort required to collect them. On the other hand, existing approaches to training VLN agents that go beyond available expert data involve data augmentations or online exploration which can be tedious and risky. In contrast, it is easy to access large repositories of suboptimal offline trajectories. Inspired by research in …

abstract agents arxiv beyond cs.cv data expert language navigation offline scaling study them training type vision vision-and-language world

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Principal Data Engineering Manager

@ Microsoft | Redmond, Washington, United States

Machine Learning Engineer

@ Apple | San Diego, California, United States