GaussianTalker: Speaker-specific Talking Head Synthesis via 3D Gaussian Splatting | allainews.com

April 23, 2024, 4:47 a.m. | Hongyun Yu, Zhan Qu, Qihang Yu, Jianchuan Chen, Zhonghua Jiang, Zhiwen Chen, Shengyu Zhang, Jimin Xu, Fei Wu, Chengfei Lv, Gang Yu

cs.CV updates on arXiv.org arxiv.org

arXiv:2404.14037v1 Announce Type: new
Abstract: Recent works on audio-driven talking head synthesis using Neural Radiance Fields (NeRF) have achieved impressive results. However, due to inadequate pose and expression control caused by NeRF implicit representation, these methods still have some limitations, such as unsynchronized or unnatural lip movements, and visual jitter and artifacts. In this paper, we propose GaussianTalker, a novel method for audio-driven talking head synthesis based on 3D Gaussian Splatting. With the explicit representation property of 3D Gaussians, intuitive …

abstract arxiv audio control cs.cv cs.mm fields head however limitations movements nerf neural radiance fields representation results speaker synthesis talking head type via visual

More from arxiv.org / cs.CV updates on arXiv.org

GPT-4V(ision) for Robotics: Multimodal Task Planning from Human Demonstration 9 hours ago | arxiv.org

abstract arxiv cs.cl cs.cv +25

Dynamic Open Vocabulary Enhanced Safe-landing with Intelligence (DOVESEI) 9 hours ago | arxiv.org

abstract arxiv attention cs.ai +16

CoVid-19 Detection leveraging Vision Transformers and Explainable AI 9 hours ago | arxiv.org

abstract arxiv covid covid-19 +19

SAR image matching algorithm based on multi-class features 9 hours ago | arxiv.org

abstract algorithm application arxiv +13

Enhancing Sign Language Teaching: A Mixed Reality Approach for Immersive Learning and Multi-Dimensional Feedback 9 hours ago | arxiv.org

abstract arxiv challenges classroom +13

A Linear Time and Space Local Point Cloud Geometry Encoder via Vectorized Kernel Mixture (VecKM) 9 hours ago | arxiv.org

abstract arxiv cloud compute +11

UP-CrackNet: Unsupervised Pixel-Wise Road Crack Detection via Adversarial Image Restoration 9 hours ago | arxiv.org

abstract adversarial algorithms arxiv +21

AttributionScanner: A Visual Analytics System for Model Validation with Metadata-Free Slice Finding 9 hours ago | arxiv.org

abstract analytics arxiv context +19

FurniScene: A Large-scale 3D Room Dataset with Intricate Furnishing Scenes 9 hours ago | arxiv.org

abstract applications arxiv attention +15

Founding AI Engineer, Agents

@ Occam AI | New York

View on ai-jobs.net

AI Engineer Intern, Agents

@ Occam AI | US

View on ai-jobs.net

AI Research Scientist

@ Vara | Berlin, Germany and Remote

View on ai-jobs.net

Data Architect

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Machine Learning Engineer - Sr. Consultant level

@ Visa | Bellevue, WA, United States

View on ai-jobs.net