April 20, 2024, 3:13 p.m. | Thomas Claburn

The Register - Software: AI + ML www.theregister.com

VASA-1 framework can turn a still image and a cloned voice file into a plausible video of a person talking

Microsoft this week demoed VASA–1, a framework for creating videos of people talking from a still image, audio sample, and text script, and claims – rightly – it's too dangerous to be released to the public.…

audio deepfake file framework good image microsoft people person release sample text vasa vasa-1 video videos voice

AI Engineer Intern, Agents

@ Occam AI | US

AI Research Scientist

@ Vara | Berlin, Germany and Remote

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Lead Data Modeler

@ Sherwin-Williams | Cleveland, OH, United States