Gaga: Group Any Gaussians via 3D-aware Memory Bank | allainews.com

April 12, 2024, 4:46 a.m. | Weijie Lyu, Xueting Li, Abhijit Kundu, Yi-Hsuan Tsai, Ming-Hsuan Yang

cs.CV updates on arXiv.org arxiv.org

arXiv:2404.07977v1 Announce Type: new
Abstract: We introduce Gaga, a framework that reconstructs and segments open-world 3D scenes by leveraging inconsistent 2D masks predicted by zero-shot segmentation models. Contrasted to prior 3D scene segmentation approaches that heavily rely on video object tracking, Gaga utilizes spatial information and effectively associates object masks across diverse camera poses. By eliminating the assumption of continuous view changes in training images, Gaga demonstrates robustness to variations in camera poses, particularly beneficial for sparsely sampled images, ensuring …

3d scenes abstract arxiv bank cs.cv diverse framework information masks memory object open-world prior segmentation spatial tracking type via video world zero-shot

More from arxiv.org / cs.CV updates on arXiv.org

A survey on deep learning in medical image registration: new technologies, uncertainty, evaluation metrics, and … 22 hours ago | arxiv.org

abstract arxiv beyond cs.cv +16

Enhancing Super-Resolution Networks through Realistic Thick-Slice CT Simulation 22 hours ago | arxiv.org

abstract acquisition arxiv cs.ai +20

TransRUPNet for Improved Polyp Segmentation 22 hours ago | arxiv.org

arxiv cs.cv eess.iv segmentation +1

An interpretable machine learning system for colorectal cancer diagnosis from pathology slides 22 hours ago | arxiv.org

abstract artificial artificial intelligence arxiv +19

Attention is All They Need: Exploring the Media Archaeology of the Computer Vision Research Paper 22 hours ago | arxiv.org

abstract archaeology arxiv attention +22

Refining Remote Photoplethysmography Architectures using CKA and Empirical Methods 22 hours ago | arxiv.org

abstract architecture architectures arxiv +8

Learning to Complement with Multiple Humans 22 hours ago | arxiv.org

abstract adoption arxiv assumptions +12

HiH: A Multi-modal Hierarchy in Hierarchy Network for Unconstrained Gait Recognition 22 hours ago | arxiv.org

abstract advances arxiv challenges +12

Image-Based Virtual Try-On: A Survey 22 hours ago | arxiv.org

arxiv cs.cv image survey +3

AI Research Scientist

@ Vara | Berlin, Germany and Remote

View on ai-jobs.net

Data Architect

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

View on ai-jobs.net

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

View on ai-jobs.net

Data Analyst (Digital Business Analyst)

@ Activate Interactive Pte Ltd | Singapore, Central Singapore, Singapore

View on ai-jobs.net