all AI news
The Lottery Ticket Hypothesis in Denoising: Towards Semantic-Driven Initialization
March 12, 2024, 4:49 a.m. | Jiafeng Mao, Xueting Wang, Kiyoharu Aizawa
cs.CV updates on arXiv.org arxiv.org
Abstract: Text-to-image diffusion models allow users control over the content of generated images. Still, text-to-image generation occasionally leads to generation failure requiring users to generate dozens of images under the same text prompt before they obtain a satisfying result. We formulate the lottery ticket hypothesis in denoising: randomly initialized Gaussian noise images contain special pixel blocks (winning tickets) that naturally tend to be denoised into specific content independently. The generation failure in standard text-to-image synthesis is …
abstract arxiv control cs.cv denoising diffusion diffusion models failure generate generated hypothesis image image diffusion image generation images leads lottery ticket hypothesis prompt semantic text text-to-image type
More from arxiv.org / cs.CV updates on arXiv.org
Jobs in AI, ML, Big Data
Data Architect
@ University of Texas at Austin | Austin, TX
Data ETL Engineer
@ University of Texas at Austin | Austin, TX
Lead GNSS Data Scientist
@ Lurra Systems | Melbourne
Senior Machine Learning Engineer (MLOps)
@ Promaton | Remote, Europe
Reporting & Data Analytics Lead (Sizewell C)
@ EDF | London, GB
Data Analyst
@ Notable | San Mateo, CA