all AI news
Google Research: Lumiere
Simon Willison's Weblog simonwillison.net
The latest in text-to-video from Google Research, described as "a text-to-video diffusion model designed for synthesizing videos that portray realistic, diverse and coherent motion".
Most existing text-to-video models generate keyframes and then use other models to fill in the gaps, which frequently leads to a lack of coherency. Lumiere "generates the full temporal duration of the video at once", which avoids this problem.
Disappointingly but unsurprisingly the paper doesn't go into much detail on the training data, …
ai diffusion diffusion model diverse generate generativeai google google research leads research temporal text text-to-video video video diffusion videos