Swap Attention in Spatiotemporal Diffusions for Text-to-Video Generation | allainews.com

April 9, 2024, 4:48 a.m. | Wenjing Wang, Huan Yang, Zixi Tuo, Huiguo He, Junchen Zhu, Jianlong Fu, Jiaying Liu

cs.CV updates on arXiv.org arxiv.org

arXiv:2305.10874v3 Announce Type: replace
Abstract: With the explosive popularity of AI-generated content (AIGC), video generation has recently received a lot of attention. Generating videos guided by text instructions poses significant challenges, such as modeling the complex relationship between space and time, and the lack of large-scale text-video paired data. Existing text-video datasets suffer from limitations in both content quality and scale, or they are not open-source, rendering them inaccessible for study and use. For model design, previous approaches extend pretrained …

abstract aigc ai-generated content arxiv attention challenges cs.cv data datasets generated modeling relationship scale space space and time text text-to-video type video video generation videos

More from arxiv.org / cs.CV updates on arXiv.org

Physics-Informed Computer Vision: A Review and Perspectives 54 minutes ago | arxiv.org

abstract application arxiv computer +26

Boosting Visual Recognition in Real-world Degradations via Unsupervised Feature Enhancement Module with Deep Channel Prior 54 minutes ago | arxiv.org

arxiv boosting cs.cv feature +8

Analyzing and Mitigating Bias for Vulnerable Classes: Towards Balanced Representation in Dataset 54 minutes ago | arxiv.org

abstract accuracy arxiv autonomous +23

GPT4Ego: Unleashing the Potential of Pre-trained Models for Zero-Shot Egocentric Action Recognition 54 minutes ago | arxiv.org

abstract action recognition advancement arxiv +23

Revisiting Sampson Approximations for Geometric Estimation Problems 54 minutes ago | arxiv.org

abstract arxiv collection computer +8

Frequency-Time Diffusion with Neural Cellular Automata 54 minutes ago | arxiv.org

abstract arxiv capabilities cellular +16

A Comprehensive Overview of Fish-Eye Camera Distortion Correction Methods 54 minutes ago | arxiv.org

abstract applications arxiv cameras +13

Adaptive Depth Networks with Skippable Sub-Paths 54 minutes ago | arxiv.org

abstract arxiv control cs.ai +11

Attention-aware Social Graph Transformer Networks for Stochastic Trajectory Prediction 54 minutes ago | arxiv.org

abstract arxiv attention autonomous +26

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

View on ai-jobs.net

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

View on ai-jobs.net

Lead Developer (AI)

@ Cere Network | San Francisco, US

View on ai-jobs.net

Research Engineer

@ Allora Labs | Remote

View on ai-jobs.net

Ecosystem Manager

@ Allora Labs | Remote

View on ai-jobs.net

Founding AI Engineer, Agents

@ Occam AI | New York

View on ai-jobs.net