Aug. 19, 2022, 1:12 a.m. | Yiran Wang, Zhiyu Pan, Xingyi Li, Zhiguo Cao, Ke Xian, Jianming Zhang

cs.CV updates on arXiv.org arxiv.org

Temporal consistency is the key challenge of video depth estimation. Previous
works are based on additional optical flow or camera poses, which is
time-consuming. By contrast, we derive consistency with less information. Since
videos inherently exist with heavy temporal redundancy, a missing frame could
be recovered from neighboring ones. Inspired by this, we propose the frame
masking network (FMNet), a spatial-temporal transformer network predicting the
depth of masked frames based on their neighboring frames. By reconstructing
masked temporal features, the …

arxiv consistent cv modeling video

Data Scientist (m/f/x/d)

@ Symanto Research GmbH & Co. KG | Spain, Germany

Data Engineer

@ Paxos | Remote - United States

Data Analytics Specialist

@ Media.Monks | Kuala Lumpur

Software Engineer III- Pyspark

@ JPMorgan Chase & Co. | India

Engineering Manager, Data Infrastructure

@ Dropbox | Remote - Canada

Senior AI NLP Engineer

@ Hyro | Tel Aviv-Yafo, Tel Aviv District, Israel