Nov. 22, 2022, 2:12 a.m. | Shailza Sharma, Abhinav Dhall, Vinay Kumar, Vivek Singh Bawa

cs.CV updates on arXiv.org arxiv.org

Recently, there has been numerous breakthroughs in face hallucination tasks.
However, the task remains rather challenging in videos in comparison to the
images due to inherent consistency issues. The presence of extra temporal
dimension in video face hallucination makes it non-trivial to learn the facial
motion through out the sequence. In order to learn these fine spatio-temporal
motion details, we propose a novel cross-modal audio-visual Video Face
Hallucination Generative Adversarial Network (VFH-GAN). The architecture
exploits the semantic correlation of between …

arxiv audio face lip reading loss reading speech support video

Software Engineer for AI Training Data (School Specific)

@ G2i Inc | Remote

Software Engineer for AI Training Data (Python)

@ G2i Inc | Remote

Software Engineer for AI Training Data (Tier 2)

@ G2i Inc | Remote

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

Lead Developer (AI)

@ Cere Network | San Francisco, US