Jan. 16, 2024, 2 p.m. | Anthony Alford

InfoQ - AI, ML & Data Engineering www.infoq.com

Google Research recently published their work on VideoPoet, a large language model (LLM) that can generate video. VideoPoet was trained on 2 trillion tokens of text, audio, image, and video data, and in evaluations by human judges its output was preferred over that of other models.

By Anthony Alford

ai anthony audio data deep learning generate generative-ai google google research human image judges language language model large language large language model large language models llm ml & data engineering neural networks research text tokens video video data video generation videopoet work

More from www.infoq.com / InfoQ - AI, ML & Data Engineering

Founding AI Engineer, Agents

@ Occam AI | New York

AI Engineer Intern, Agents

@ Occam AI | US

AI Research Scientist

@ Vara | Berlin, Germany and Remote

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne