Feb. 28, 2024, 5:46 a.m. | Xiaohan Lei, Min Wang, Wengang Zhou, Li Li, Houqiang Li

cs.CV updates on arXiv.org arxiv.org

arXiv:2402.17587v1 Announce Type: new
Abstract: As a new embodied vision task, Instance ImageGoal Navigation (IIN) aims to navigate to a specified object depicted by a goal image in an unexplored environment.
The main challenge of this task lies in identifying the target object from different viewpoints while rejecting similar distractors.
Existing ImageGoal Navigation methods usually adopt the simple Exploration-Exploitation framework and ignore the identification of specific instance during navigation.
In this work, we propose to imitate the human behaviour of …

abstract arxiv challenge cs.cv cs.ro embodied environment exploitation exploration image instance lies navigation type verification vision

Founding AI Engineer, Agents

@ Occam AI | New York

AI Engineer Intern, Agents

@ Occam AI | US

AI Research Scientist

@ Vara | Berlin, Germany and Remote

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne