Enabling Visual Composition and Animation in Unsupervised Video Generation | allainews.com

March 22, 2024, 4:45 a.m. | Aram Davtyan, Sepehr Sameni, Bj\"orn Ommer, Paolo Favaro

cs.CV updates on arXiv.org arxiv.org

arXiv:2403.14368v1 Announce Type: new
Abstract: In this work we propose a novel method for unsupervised controllable video generation. Once trained on a dataset of unannotated videos, at inference our model is capable of both composing scenes of predefined object parts and animating them in a plausible and controlled way. This is achieved by conditioning video generation on a randomly selected subset of local pre-trained self-supervised features during training. We call our model CAGE for visual Composition and Animation for video …

animation arxiv cs.cv enabling type unsupervised video video generation visual

More from arxiv.org / cs.CV updates on arXiv.org

Gradient-based Local Next-best-view Planning for Improved Perception of Targeted Plant Nodes 6 hours ago | arxiv.org

abstract arxiv automate cs.cv +11

Radarize: Enhancing Radar SLAM with Generalizable Doppler-Based Odometry 6 hours ago | arxiv.org

abstract alternative arxiv challenges +17

Artificial Intelligence in Assessing Cardiovascular Diseases and Risk Factors via Retinal Fundus Images: A Review … 6 hours ago | arxiv.org

abstract analysis artificial artificial intelligence +14

BMAD: Benchmarks for Medical Anomaly Detection 6 hours ago | arxiv.org

anomaly anomaly detection arxiv benchmarks +5

Has the Virtualization of the Face Changed Facial Perception? A Study of the Impact of … 6 hours ago | arxiv.org

abstract arxiv augmented reality communication +14

Neural \'{E}tendue Expander for Ultra-Wide-Angle High-Fidelity Holographic Display 6 hours ago | arxiv.org

abstract applications arxiv augmented reality +14

Forensic Iris Image-Based Post-Mortem Interval Estimation 6 hours ago | arxiv.org

abstract application arxiv cs.cv +9

InverseMatrixVT3D: An Efficient Projection Matrix-Based Approach for 3D Occupancy Prediction 6 hours ago | arxiv.org

arxiv cs.cv cs.ro matrix +3

Amodal Ground Truth and Completion in the Wild 6 hours ago | arxiv.org

arxiv cs.cv truth type

Data Architect

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

View on ai-jobs.net

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

View on ai-jobs.net

Data Engineer - AWS

@ 3Pillar Global | Costa Rica

View on ai-jobs.net

Cost Controller/ Data Analyst - India

@ John Cockerill | Mumbai, India, India, India

View on ai-jobs.net