April 15, 2022, 1:12 a.m. | Sina Honari, Victor Constantin, Helge Rhodin, Mathieu Salzmann, Pascal Fua

cs.LG updates on arXiv.org arxiv.org

In this paper we propose an unsupervised learning method to extract temporal
information on monocular videos, where we detect and encode subject of interest
in each frame and leverage contrastive self-supervised (CSS) learning to
extract rich latent vectors. Instead of simply treating the latent features of
nearby frames as positive pairs and those of temporally-distant ones as
negative pairs as in other CSS approaches, we explicitly disentangle each
latent vector into a time-variant component and a time-invariant one. We then …

3d arxiv cv human learning unsupervised videos

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Machine Learning Engineer (m/f/d)

@ StepStone Group | Düsseldorf, Germany

2024 GDIA AI/ML Scientist - Supplemental

@ Ford Motor Company | United States