ConditionVideo: Training-Free Condition-Guided Text-to-Video Generation | allainews.com

May 24, 2024, 4:52 a.m. | Bo Peng, Xinyuan Chen, Yaohui Wang, Chaochao Lu, Yu Qiao

cs.CV updates on arXiv.org arxiv.org

arXiv:2310.07697v2 Announce Type: replace
Abstract: Recent works have successfully extended large-scale text-to-image models to the video domain, producing promising results but at a high computational cost and requiring a large amount of video data. In this work, we introduce ConditionVideo, a training-free approach to text-to-video generation based on the provided condition, video, and input text, by leveraging the power of off-the-shelf text-to-image generation methods (e.g., Stable Diffusion). ConditionVideo generates realistic dynamic videos from random noise or given scene videos. Our …

abstract arxiv computational cost cs.cv data domain free image replace results scale text text-to-image text-to-video training type video video data video generation work

More from arxiv.org / cs.CV updates on arXiv.org

DIAS: A Dataset and Benchmark for Intracranial Artery Segmentation in DSA sequences 2 days, 12 hours ago | arxiv.org

arxiv benchmark cs.cv dataset +6

Benchmarking Pretrained Vision Embeddings for Near- and Duplicate Detection in Medical Images 2 days, 12 hours ago | arxiv.org

abstract arxiv benchmarking biases +20

MAFA: Managing False Negatives for Vision-Language Pre-training 2 days, 12 hours ago | arxiv.org

arxiv cs.ai cs.cv false +7

Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation 2 days, 12 hours ago | arxiv.org

abstract animate anyone animation arxiv +23

KNVQA: A Benchmark for evaluation knowledge-based VQA 2 days, 12 hours ago | arxiv.org

abstract accuracy arxiv benchmark +22

Optimization Efficient Open-World Visual Region Recognition 2 days, 12 hours ago | arxiv.org

abstract arxiv building capabilities +25

HyperFields: Towards Zero-Shot Generation of NeRFs from Text 2 days, 12 hours ago | arxiv.org

abstract arxiv cs.cv distillation +14

Multi-modal Learning with Missing Modality via Shared-Specific Feature Modelling 2 days, 12 hours ago | arxiv.org

arxiv cs.cv feature modal +5

A Generative Model for Digital Camera Noise Synthesis 2 days, 12 hours ago | arxiv.org

abstract arxiv cs.cv digital +14

Senior Data Engineer

@ Displate | Warsaw

View on ai-jobs.net

Junior Data Analyst - ESG Data

@ Institutional Shareholder Services | Mumbai

View on ai-jobs.net

Intern Data Driven Development in Sensor Fusion for Autonomous Driving (f/m/x)

@ BMW Group | Munich, DE

View on ai-jobs.net

Senior MLOps Engineer, Machine Learning Platform

@ GetYourGuide | Berlin

View on ai-jobs.net

Data Engineer, Analytics

@ Meta | Menlo Park, CA

View on ai-jobs.net

Data Engineer

@ Meta | Menlo Park, CA

View on ai-jobs.net