Aug. 16, 2022, 1:13 a.m. | Miao Liu, Lingni Ma, Kiran Somasundaram, Yin Li, Kristen Grauman, James M. Rehg, Chao Li

cs.CV updates on arXiv.org arxiv.org

Given a video captured from a first person perspective and the environment
context of where the video is recorded, can we recognize what the person is
doing and identify where the action occurs in the 3D space? We address this
challenging problem of jointly recognizing and localizing actions of a mobile
user on a known 3D map from egocentric videos. To this end, we propose a novel
deep probabilistic model. Our model takes the inputs of a Hierarchical
Volumetric Representation …

3d 3d map arxiv cv localization map

Founding AI Engineer, Agents

@ Occam AI | New York

AI Engineer Intern, Agents

@ Occam AI | US

AI Research Scientist

@ Vara | Berlin, Germany and Remote

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead Data Engineer

@ WorkMoney | New York City, United States - Remote