June 15, 2024, 4:16 a.m. | /u/Fit_Entrepreneur_588

Machine Learning www.reddit.com

The classifier-free guidance (CFG) is widely used for text-guidance in diffusion models, but notorious for its challenges, such as difficulty in DDIM inversion and ambiguity in selecting a large guidance scale.

This paper demonstrates that these limitations of CFG stem from inherent design flaws in the original CFG, and introduce CFG++, a simple yet powerful fix in the \**re\**nosing process. This adjustment facilitates smaller guidance scales, significantly improved invertibility, and much better alignment between images and text.

Project page: [https://cfgpp-diffusion.github.io/](https://cfgpp-diffusion.github.io/) …

challenges classifier design diffusion diffusion models fix flaws free guidance limitations machinelearning paper scale simple stem text

Senior Data Engineer

@ Displate | Warsaw

Engineer III, Back-End Server (mult.)

@ Samsung Electronics | 645 Clyde Avenue, Mountain View, CA, USA

Senior Product Security Engineer - Cyber Security Researcher

@ Boeing | USA - Arlington, VA

Senior Manager, Software Engineering, DevOps

@ Capital One | Richmond, VA

PGIM Quantitative Solutions, Investment Multi-Asset Research (Hybrid)

@ Prudential Financial | Prudential Tower, 655 Broad Street, Newark, NJ

Cyber Security Engineer

@ HP | FTC02 - Fort Collins, CO East Link (FTC02)