Feb. 9, 2024, 5:46 a.m. | Wamiq Reyaz Para Abdelrahman Eldesokey Zhenyu Li Pradyumna Reddy Jiankang Deng Peter Wonka

cs.CV updates on arXiv.org arxiv.org

We introduce an approach for 3D head avatar generation and editing with multi-modal conditioning based on a 3D Generative Adversarial Network (GAN) and a Latent Diffusion Model (LDM). 3D GANs can generate high-quality head avatars given a single or no condition. However, it is challenging to generate samples that adhere to multiple conditions of different modalities. On the other hand, LDMs excel at learning complex conditional distributions. To this end, we propose to exploit the conditioning capabilities of LDMs to …

adversarial avatar avatars cs.cv cs.gr diffusion diffusion model editing gan gans generate generative generative adversarial network head ldm modal multi-modal network quality samples

AI Research Scientist

@ Vara | Berlin, Germany and Remote

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

AI Engineering Manager

@ M47 Labs | Barcelona, Catalunya [Cataluña], Spain