Conditioning mechanism in DiT | allainews.com

April 14, 2024, 3:07 a.m. | /u/Jazzlike-Common-8978

Deep Learning www.reddit.com

Hi guys, I want to ask about conditioning mechanism in DiT (Diffusion Transformer). It use AdaLN which is scale & shift operators, and the author report that it is better than cross attention. However, I think AdaLN must be worse than cross attention becase it only allows the condition infomation to be a vector which limit amount of carrying information.

Am I correct?

https://preview.redd.it/ljyh2xop1duc1.png?width=1181&format=png&auto=webp&s=78ba16dfd3a44e894a3714719cac9e9cd3d732a8

attention author deeplearning diffusion diffusion transformer however operators report scale shift think transformer vector

More from www.reddit.com / Deep Learning

Any tips how to start DL? 1 day, 2 hours ago | www.reddit.com

artificial artificial intelligence data data science +10

How Netflix Uses Machine Learning To Decide What Content To Create Next For Its 260M … 2 days, 4 hours ago | www.reddit.com

create deeplearning embeddings guide +8

What amount of data makes up a tensor? 2 days, 11 hours ago | www.reddit.com

current data deeplearning functions +8

What are the best websites to find state-of-the-art (SOTA) deep learning models at the moment? 3 days, 7 hours ago | www.reddit.com

art classification deep learning deeplearning +8

Why does IA still struggle with colorization of old movies. 3 days, 15 hours ago | www.reddit.com

colorization data deeplearning look +7

how to utilize my time? 3 days, 21 hours ago | www.reddit.com

basics computer computer vision deep learning +7

Training an Small Language Model 4 days, 1 hour ago | www.reddit.com

architecture dataset deeplearning language +8

[Advice] Master in AI or Math (if you are bad at math) 4 days, 5 hours ago | www.reddit.com

advice computer computer science deep learning +7

Perceptron Visualization 5 days, 1 hour ago | www.reddit.com

deeplearning perceptron visualization

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

View on ai-jobs.net

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

View on ai-jobs.net

Lead Developer (AI)

@ Cere Network | San Francisco, US

View on ai-jobs.net

Research Engineer

@ Allora Labs | Remote

View on ai-jobs.net

Ecosystem Manager

@ Allora Labs | Remote

View on ai-jobs.net

Founding AI Engineer, Agents

@ Occam AI | New York

View on ai-jobs.net