Nov. 2, 2022, 9:54 p.m. | /u/cloneofsimo

Machine Learning www.reddit.com

Hi. Today I've came across this interesting paper [https://arxiv.org/abs/2210.16056](https://arxiv.org/abs/2210.16056) that proposes interesting method to combine semantics of text and image in diffusion process.

In short, this mixes "layout" with "content", however unlike style transfer,


>"...semantic mixing aims to fuse multiple semantics into one single object."

I was surprised by the examples they showed, so I wanted to try it but the code wasn't available. I've implemented the method myself, and I wanted to share it here!

[https://github.com/cloneofsimo/magicmix](https://github.com/cloneofsimo/magicmix)

[ Layout of …

bytedance diffusion implementation machinelearning natural researchers stable diffusion

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Vice President, Data Science, Marketplace

@ Xometry | North Bethesda, Maryland, Lexington, KY, Remote

Field Solutions Developer IV, Generative AI, Google Cloud

@ Google | Toronto, ON, Canada; Atlanta, GA, USA