Deep Learning for Visual Speech Analysis: A Survey | allainews.com

March 15, 2024, 4:46 a.m. | Changchong Sheng, Gangyao Kuang, Liang Bai, Chenping Hou, Yulan Guo, Xin Xu, Matti Pietik\"ainen, Li Liu

cs.CV updates on arXiv.org arxiv.org

arXiv:2205.10839v2 Announce Type: replace
Abstract: Visual speech, referring to the visual domain of speech, has attracted increasing attention due to its wide applications, such as public security, medical treatment, military defense, and film entertainment. As a powerful AI strategy, deep learning techniques have extensively promoted the development of visual speech learning. Over the past five years, numerous deep learning based methods have been proposed to address various problems in this area, especially automatic visual speech recognition and generation. To push …

abstract ai strategy analysis applications arxiv attention cs.cv deep learning deep learning techniques defense development domain entertainment film medical military promoted public security speech speech analysis strategy survey treatment type visual

More from arxiv.org / cs.CV updates on arXiv.org

Physics-Informed Computer Vision: A Review and Perspectives 5 hours ago | arxiv.org

abstract application arxiv computer +26

Boosting Visual Recognition in Real-world Degradations via Unsupervised Feature Enhancement Module with Deep Channel Prior 5 hours ago | arxiv.org

arxiv boosting cs.cv feature +8

Analyzing and Mitigating Bias for Vulnerable Classes: Towards Balanced Representation in Dataset 5 hours ago | arxiv.org

abstract accuracy arxiv autonomous +23

GPT4Ego: Unleashing the Potential of Pre-trained Models for Zero-Shot Egocentric Action Recognition 5 hours ago | arxiv.org

abstract action recognition advancement arxiv +23

Revisiting Sampson Approximations for Geometric Estimation Problems 5 hours ago | arxiv.org

abstract arxiv collection computer +8

Frequency-Time Diffusion with Neural Cellular Automata 5 hours ago | arxiv.org

abstract arxiv capabilities cellular +16

A Comprehensive Overview of Fish-Eye Camera Distortion Correction Methods 5 hours ago | arxiv.org

abstract applications arxiv cameras +13

Adaptive Depth Networks with Skippable Sub-Paths 5 hours ago | arxiv.org

abstract arxiv control cs.ai +11

Attention-aware Social Graph Transformer Networks for Stochastic Trajectory Prediction 5 hours ago | arxiv.org

abstract arxiv attention autonomous +26

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

View on ai-jobs.net

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

View on ai-jobs.net

Lead Developer (AI)

@ Cere Network | San Francisco, US

View on ai-jobs.net

Research Engineer

@ Allora Labs | Remote

View on ai-jobs.net

Ecosystem Manager

@ Allora Labs | Remote

View on ai-jobs.net

Founding AI Engineer, Agents

@ Occam AI | New York

View on ai-jobs.net