April 2, 2024, 7:48 p.m. | Siddhant Jain, Daniel Watson, Eric Tabellion, Aleksander Ho{\l}y\'nski, Ben Poole, Janne Kontkanen

cs.CV updates on arXiv.org arxiv.org

arXiv:2404.01203v1 Announce Type: new
Abstract: We present VIDIM, a generative model for video interpolation, which creates short videos given a start and end frame. In order to achieve high fidelity and generate motions unseen in the input data, VIDIM uses cascaded diffusion models to first generate the target video at low resolution, and then generate the high-resolution video conditioned on the low-resolution generated video. We compare VIDIM to previous state-of-the-art methods on video interpolation, and demonstrate how such works fail …

abstract arxiv cs.cv data diffusion diffusion models fidelity generate generative low resolution type video videos

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Senior Principal, Product Strategy Operations, Cloud Data Analytics

@ Google | Sunnyvale, CA, USA; Austin, TX, USA

Data Scientist - HR BU

@ ServiceNow | Hyderabad, India