all AI news
Chat2Map: Efficient Scene Mapping from Multi-Ego Conversations. (arXiv:2301.02184v2 [cs.CV] UPDATED)
cs.LG updates on arXiv.org arxiv.org
Can conversational videos captured from multiple egocentric viewpoints reveal
the map of a scene in a cost-efficient way? We seek to answer this question by
proposing a new problem: efficiently building the map of a previously unseen 3D
environment by exploiting shared information in the egocentric audio-visual
observations of participants in a natural conversation. Our hypothesis is that
as multiple people ("egos") move in a scene and talk among themselves, they
receive rich audio-visual cues that can help uncover the …
arxiv audio building conversation conversational conversations cost environment hypothesis information map mapping multiple natural people talk videos visual cues