Learning Separable Hidden Unit Contributions for Speaker-Adaptive Lip-Reading | allainews.com

May 1, 2024, 4:46 a.m. | Songtao Luo, Shuang Yang, Shiguang Shan, Xilin Chen

cs.CV updates on arXiv.org arxiv.org

arXiv:2310.05058v3 Announce Type: replace
Abstract: In this paper, we propose a novel method for speaker adaptation in lip reading, motivated by two observations. Firstly, a speaker's own characteristics can always be portrayed well by his/her few facial images or even a single image with shallow networks, while the fine-grained dynamic features associated with speech content expressed by the talking face always need deep sequential networks to represent accurately. Therefore, we treat the shallow and deep layers differently for speaker adaptive …

abstract arxiv cs.ai cs.cv cs.sd dynamic eess.as features fine-grained her hidden image images lip reading networks novel paper reading speaker type while

More from arxiv.org / cs.CV updates on arXiv.org

DisBeaNet: A Deep Neural Network to augment Unmanned Surface Vessels for maritime situational awareness 17 hours ago | arxiv.org

abstract arxiv augment automated +18

Contextual Embedding Learning to Enhance 2D Networks for Volumetric Image Segmentation 17 hours ago | arxiv.org

arxiv cs.cv eess.iv embedding +5

KI-PMF: Knowledge Integrated Plausible Motion Forecasting 17 hours ago | arxiv.org

abstract actors arxiv autonomous +18

Adaptive Landmark Color for AUV Docking in Visually Dynamic Environments 17 hours ago | arxiv.org

abstract arxiv autonomous batteries +15

OccupancyDETR: Using DETR for Mixed Dense-sparse 3D Occupancy Prediction 17 hours ago | arxiv.org

abstract arxiv autonomous autonomous vehicles +21

Multimodal Chain-of-Thought Reasoning in Language Models 17 hours ago | arxiv.org

arxiv cs.ai cs.cl cs.cv +7

Robust Self-Tuning Data Association for Geo-Referencing Using Lane Markings 17 hours ago | arxiv.org

abstract advantages aerial arxiv +16

Surrogate-based cross-correlation for particle image velocimetry 17 hours ago | arxiv.org

arxiv correlation cs.cv eess.iv +5

Advancing Human Action Recognition with Foundation Models trained on Unlabeled Public Videos 17 hours ago | arxiv.org

abstract action action recognition advance +20

Software Engineer for AI Training Data (School Specific)

@ G2i Inc | Remote

View on ai-jobs.net

Software Engineer for AI Training Data (Python)

@ G2i Inc | Remote

View on ai-jobs.net

Software Engineer for AI Training Data (Tier 2)

@ G2i Inc | Remote

View on ai-jobs.net

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

View on ai-jobs.net

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

View on ai-jobs.net

.NET Software Engineer (AI Focus)

@ Boskalis | Papendrecht, Netherlands

View on ai-jobs.net