Human-Aware Vision-and-Language Navigation: Bridging Simulation to Reality with Dynamic Human Interactions | allainews.com

June 28, 2024, 4:47 a.m. | Minghan Li, Heng Li, Zhi-Qi Cheng, Yifei Dong, Yuxuan Zhou, Jun-Yan He, Qi Dai, Teruko Mitamura, Alexander G. Hauptmann

cs.CV updates on arXiv.org arxiv.org

arXiv:2406.19236v1 Announce Type: cross
Abstract: Vision-and-Language Navigation (VLN) aims to develop embodied agents that navigate based on human instructions. However, current VLN frameworks often rely on static environments and optimal expert supervision, limiting their real-world applicability. To address this, we introduce Human-Aware Vision-and-Language Navigation (HA-VLN), extending traditional VLN by incorporating dynamic human activities and relaxing key assumptions. We propose the Human-Aware 3D (HA3D) simulator, which combines dynamic human activities with the Matterport3D dataset, and the Human-Aware Room-to-Room (HA-R2R) dataset, extending …

abstract agents arxiv cs.ai cs.cv cs.ro current dynamic embodied environments expert frameworks however human human interactions interactions language navigation reality simulation supervision type vision vision-and-language world

More from arxiv.org / cs.CV updates on arXiv.org

PlaNet-S: Automatic Semantic Segmentation of Placenta 2 days, 11 hours ago | arxiv.org

abstract architectures arxiv automated +15

FDDM: Unsupervised Medical Image Translation with a Frequency-Decoupled Diffusion Model 2 days, 11 hours ago | arxiv.org

abstract arxiv cs.cv current +20

Continuous 3D Myocardial Motion Tracking via Echocardiography 2 days, 11 hours ago | arxiv.org

abstract arxiv clinical continuous +17

Optimal Transport Aggregation for Visual Place Recognition 2 days, 11 hours ago | arxiv.org

aggregation arxiv cs.cv recognition +4

BT-Adapter: Video Conversation is Feasible Without Video Instruction Tuning 2 days, 11 hours ago | arxiv.org

abstract adapter agents arxiv +22

AutoProSAM: Automated Prompting SAM for 3D Multi-Organ Segmentation 2 days, 11 hours ago | arxiv.org

abstract applications arxiv automated +23

LiverUSRecon: Automatic 3D Reconstruction and Volumetry of the Liver with a Few Partial Ultrasound Scans 2 days, 11 hours ago | arxiv.org

3d reconstruction abstract acquisition analysis +10

ALMA: a mathematics-driven approach for determining tuning parameters in generalized LASSO problems, with applications to … 2 days, 11 hours ago | arxiv.org

abstract acquisition applications artifacts +19

Human-Aware Vision-and-Language Navigation: Bridging Simulation to Reality with Dynamic Human Interactions 2 days, 11 hours ago | arxiv.org

abstract agents arxiv cs.ai +21

Data Scientist

@ Ford Motor Company | Chennai, Tamil Nadu, India

View on ai-jobs.net

Systems Software Engineer, Graphics

@ Parallelz | Vancouver, British Columbia, Canada - Remote

View on ai-jobs.net

Engineering Manager - Geo Engineering Team (F/H/X)

@ AVIV Group | Paris, France

View on ai-jobs.net

Data Analyst

@ Microsoft | San Antonio, Texas, United States

View on ai-jobs.net

Azure Data Engineer

@ TechVedika | Hyderabad, India

View on ai-jobs.net

Senior Data & AI Threat Detection Researcher (Cortex)

@ Palo Alto Networks | Tel Aviv-Yafo, Israel

View on ai-jobs.net