April 5, 2024, 4:45 a.m. | Hanzhe Hu, Zhizhuo Zhou, Varun Jampani, Shubham Tulsiani

cs.CV updates on arXiv.org arxiv.org

arXiv:2404.03656v1 Announce Type: new
Abstract: We present MVD-Fusion: a method for single-view 3D inference via generative modeling of multi-view-consistent RGB-D images. While recent methods pursuing 3D inference advocate learning novel-view generative models, these generations are not 3D-consistent and require a distillation process to generate a 3D output. We instead cast the task of 3D inference as directly generating mutually-consistent multiple views and build on the insight that additionally inferring depth can provide a mechanism for enforcing this consistency. Specifically, we …

abstract arxiv consistent cs.cv distillation fusion generate generative generative modeling generative models images inference modeling novel process rgb-d type via view

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

Lead Developer (AI)

@ Cere Network | San Francisco, US

Research Engineer

@ Allora Labs | Remote

Ecosystem Manager

@ Allora Labs | Remote

Founding AI Engineer, Agents

@ Occam AI | New York