all AI news
MARLIN: Masked Autoencoder for facial video Representation LearnINg. (arXiv:2211.06627v1 [cs.CV])
cs.CV updates on arXiv.org arxiv.org
This paper proposes a self-supervised approach to learn universal facial
representations from videos, that can transfer across a variety of facial
analysis tasks such as Facial Attribute Recognition (FAR), Facial Expression
Recognition (FER), DeepFake Detection (DFD), and Lip Synchronization (LS). Our
proposed framework, named MARLIN, is a facial video masked autoencoder, that
learns highly robust and generic facial embeddings from abundantly available
non-annotated web crawled facial videos. As a challenging auxiliary task,
MARLIN reconstructs the spatio-temporal details of the face …
arxiv autoencoder masked autoencoder representation representation learning video