Deep Learning for Visual Speech Analysis: A Survey | allainews.com

March 15, 2024, 4:46 a.m. | Changchong Sheng, Gangyao Kuang, Liang Bai, Chenping Hou, Yulan Guo, Xin Xu, Matti Pietik\"ainen, Li Liu

cs.CV updates on arXiv.org arxiv.org

arXiv:2205.10839v2 Announce Type: replace
Abstract: Visual speech, referring to the visual domain of speech, has attracted increasing attention due to its wide applications, such as public security, medical treatment, military defense, and film entertainment. As a powerful AI strategy, deep learning techniques have extensively promoted the development of visual speech learning. Over the past five years, numerous deep learning based methods have been proposed to address various problems in this area, especially automatic visual speech recognition and generation. To push …

abstract ai strategy analysis applications arxiv attention cs.cv deep learning deep learning techniques defense development domain entertainment film medical military promoted public security speech speech analysis strategy survey treatment type visual

More from arxiv.org / cs.CV updates on arXiv.org

NOLA: Compressing LoRA using Linear Combination of Random Basis 20 hours ago | arxiv.org

arxiv combination cs.cl cs.cv +4

ReWiTe: Realistic Wide-angle and Telephoto Dual Camera Fusion Dataset via Beam Splitter Camera Rig 20 hours ago | arxiv.org

abstract arxiv become cs.cv +7

An Effective Image Copy-Move Forgery Detection Using Entropy Information 20 hours ago | arxiv.org

abstract academic algorithms arxiv +20

SimAC: A Simple Anti-Customization Method for Protecting Face Privacy against Text-to-Image Synthesis of Diffusion Models 20 hours ago | arxiv.org

arxiv cs.cv customization diffusion +9

SeaTurtleID2022: A long-span dataset for reliable sea turtle re-identification 20 hours ago | arxiv.org

arxiv cs.cv dataset identification +1

Learning Separable Hidden Unit Contributions for Speaker-Adaptive Lip-Reading 20 hours ago | arxiv.org

abstract arxiv cs.ai cs.cv +17

Conditioning Generative Latent Optimization for Sparse-View CT Image Reconstruction 20 hours ago | arxiv.org

abstract arxiv benefit cs.cv +17

Fast and Accurate Unknown Object Instance Segmentation through Error-Informed Refinement 20 hours ago | arxiv.org

abstract arxiv autonomous autonomous robots +17

Instance-dependent Noisy-label Learning with Graphical Model Based Noise-rate Estimation 20 hours ago | arxiv.org

abstract arxiv challenge cs.cv +10

Data Architect

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

View on ai-jobs.net

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

View on ai-jobs.net

Data Scientist

@ Publicis Groupe | New York City, United States

View on ai-jobs.net

Bigdata Cloud Developer - Spark - Assistant Manager

@ State Street | Hyderabad, India

View on ai-jobs.net