all AI news
Less is More: Consistent Video Depth Estimation with Masked Frames Modeling. (arXiv:2208.00380v2 [cs.CV] UPDATED)
Aug. 19, 2022, 1:12 a.m. | Yiran Wang, Zhiyu Pan, Xingyi Li, Zhiguo Cao, Ke Xian, Jianming Zhang
cs.CV updates on arXiv.org arxiv.org
Temporal consistency is the key challenge of video depth estimation. Previous
works are based on additional optical flow or camera poses, which is
time-consuming. By contrast, we derive consistency with less information. Since
videos inherently exist with heavy temporal redundancy, a missing frame could
be recovered from neighboring ones. Inspired by this, we propose the frame
masking network (FMNet), a spatial-temporal transformer network predicting the
depth of masked frames based on their neighboring frames. By reconstructing
masked temporal features, the …
More from arxiv.org / cs.CV updates on arXiv.org
Jobs in AI, ML, Big Data
Data Scientist (m/f/x/d)
@ Symanto Research GmbH & Co. KG | Spain, Germany
Data Engineer
@ Paxos | Remote - United States
Data Analytics Specialist
@ Media.Monks | Kuala Lumpur
Software Engineer III- Pyspark
@ JPMorgan Chase & Co. | India
Engineering Manager, Data Infrastructure
@ Dropbox | Remote - Canada
Senior AI NLP Engineer
@ Hyro | Tel Aviv-Yafo, Tel Aviv District, Israel