Video generation models as world simulators | allainews.com

Feb. 15, 2024, 8 a.m. |

OpenAI Blog openai.com

We explore large-scale training of generative models on video data. Specifically, we train text-conditional diffusion models jointly on videos and images of variable durations, resolutions and aspect ratios. We leverage a transformer architecture that operates on spacetime patches of video and image latent codes. Our largest model, Sora, is capable of generating a minute of high fidelity video. Our results suggest that scaling video generation models is a promising path towards building general purpose simulators of the physical world.

architecture data diffusion diffusion models explore generative generative models image images scale sora text train training transformer transformer architecture video video data video generation videos world

More from openai.com / OpenAI Blog

OpenAI’s commitment to child safety: adopting safety by design principles 5 days, 9 hours ago | openai.com

child children commitment companies +7

Introducing more enterprise-grade features for API customers 5 days, 9 hours ago | openai.com

api assistants costs customers +6

Introducing OpenAI Japan 2 weeks ago | openai.com

asia gpt gpt-4 japan +4

Introducing improvements to the fine-tuning API and expanding our custom models program 3 weeks, 3 days ago | openai.com

api build control custom models +6

Start using ChatGPT instantly 3 weeks, 6 days ago | openai.com

benefits benefits of ai chatgpt experience +2

Navigating the Challenges and Opportunities of Synthetic Voices 4 weeks, 2 days ago | openai.com

challenges opportunities scale small +4

Sora: First Impressions 1 month ago | openai.com

community creative feedback impressions +1

Global news partnerships: Le Monde and Prisa Media 1 month, 2 weeks ago | openai.com

chatgpt french global international +4

Review completed & Altman, Brockman to continue to lead OpenAI 1 month, 2 weeks ago | openai.com

altman board brockman governance +4

Data Architect

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

View on ai-jobs.net

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

View on ai-jobs.net

Research Scientist, Demography and Survey Science, University Grad

@ Meta | Menlo Park, CA | New York City

View on ai-jobs.net

Computer Vision Engineer, XR

@ Meta | Burlingame, CA

View on ai-jobs.net