Encoder-Decoder Model | allainews.com

Nov. 15, 2023, 1:23 p.m. | /u/duffano

Deep Learning www.reddit.com

Dear all,

I had a look at the encoder-decoder architecture following the seminal paper "Attention is all you need".

After doing experiments on my own and doing further reading, I found many sources saying that the (maximum) input lengths of encoder and decoder are usually the same, or that there is no reason in practice to use different legnths (see e.g. [https://stats.stackexchange.com/questions/603535/in-transformers-for-the-maximum-length-of-encoders-input-sequences-and-decoder](https://stats.stackexchange.com/questions/603535/in-transformers-for-the-maximum-length-of-encoders-input-sequences-and-decoder)).

What puzzles me is the "usually". I want to understand the thing on the mathematical level, and I …

architecture attention attention is all you need decoder deeplearning encoder encoder-decoder look paper practice reason

More from www.reddit.com / Deep Learning

How are decesion boundry drawn in feature space ? 1 day, 5 hours ago | www.reddit.com

ann cnn context data +7

Best Resources to Learn Computer Vision in 2024 1 day, 9 hours ago | www.reddit.com

computer computer vision deeplearning learn +2

Any tips how to start DL? 2 days, 5 hours ago | www.reddit.com

artificial artificial intelligence data data science +10

How Netflix Uses Machine Learning To Decide What Content To Create Next For Its 260M … 3 days, 6 hours ago | www.reddit.com

create deeplearning embeddings guide +8

What amount of data makes up a tensor? 3 days, 13 hours ago | www.reddit.com

current data deeplearning functions +8

What are the best websites to find state-of-the-art (SOTA) deep learning models at the moment? 4 days, 9 hours ago | www.reddit.com

art classification deep learning deeplearning +8

Why does IA still struggle with colorization of old movies. 4 days, 18 hours ago | www.reddit.com

colorization data deeplearning look +7

how to utilize my time? 5 days ago | www.reddit.com

basics computer computer vision deep learning +7

Training an Small Language Model 5 days, 3 hours ago | www.reddit.com

architecture dataset deeplearning language +8

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

View on ai-jobs.net

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

View on ai-jobs.net

Lead Developer (AI)

@ Cere Network | San Francisco, US

View on ai-jobs.net

Research Engineer

@ Allora Labs | Remote

View on ai-jobs.net

Ecosystem Manager

@ Allora Labs | Remote

View on ai-jobs.net

Founding AI Engineer, Agents

@ Occam AI | New York

View on ai-jobs.net