Nov. 6, 2022, 8:42 p.m. | /u/cloneofsimo

Machine Learning www.reddit.com

Hi. Very recently researchers from NVIDIA released their recent work on text-to-image diffusion models, eDiffi ([https://deepimagination.cc/eDiffi/](https://deepimagination.cc/eDiffi/)) . In their paper they proposed various methods, including paint-with-words.

Paint-with-words let you generate image from arbitrary text-labeled segmentation map. Checkout their paper and method for more details.

Unfortunately, their code + eDiffi models were not available. However, Stable Diffusion can do just the same, as they both have cross attention module.

I've tried to make it work with stable diffusion, and it worked! …

diffusion images implementation machinelearning map nvidia segmentation text words

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Vice President, AI Product Manager

@ JPMorgan Chase & Co. | New York City, United States

Binance Accelerator Program - Data Engineer

@ Binance | Asia