Why does transformers use "fixed" position encoding? | allainews.com

Dec. 3, 2023, 1:07 p.m. | /u/graphitout

Deep Learning www.reddit.com

When I was reading about attention initially, I assumed that the positional encoding is relative (as in difference in positions of query and key). But as per the paper "Attention is all you need", the position encoding vector seems to be fixed. The paper states that:

"We chose this function because we hypothesized it would allow the model to easily learn to attend by relative positions, since for any fixed offset k, P Epos+k can be represented as a linear …

attention attention is all you need deeplearning difference encoding function paper per positional encoding query reading transformers vector

More from www.reddit.com / Deep Learning

How LLMs are trained? A simple guide to understand LLM Training 21 hours ago | www.reddit.com

deeplearning guide llm llms +3

What is the efficient way of learning ML? 22 hours ago | www.reddit.com

concepts course deeplearning python +3

Update v1.2 of the "Little Book of Deep Learning." Minor changes + a new chapter … 23 hours ago | www.reddit.com

book deep learning deeplearning llms +4

Kolmogorov-Arnold Networks (KANs) Explained: A Superior Alternative to MLPs 1 day, 5 hours ago | www.reddit.com

Classification of images with numerical "continous" categories 2 days, 19 hours ago | www.reddit.com

age classification clear deeplearning +6

How can I truly learn to code the models, not just understand them? 3 days, 9 hours ago | www.reddit.com

architectures code coding concepts +9

How does gradient descent work in random forest 3 days, 10 hours ago | www.reddit.com

beast deeplearning gradient parameters +2

Prerequisites for jumping into transformers? 3 days, 12 hours ago | www.reddit.com

basics cnns concepts deep learning +11

[Reading] Deeplearning by goodfellow 3 days, 18 hours ago | www.reddit.com

alternative assessment bayesian change +9

Software Engineer for AI Training Data (School Specific)

@ G2i Inc | Remote

View on ai-jobs.net

Software Engineer for AI Training Data (Python)

@ G2i Inc | Remote

View on ai-jobs.net

Software Engineer for AI Training Data (Tier 2)

@ G2i Inc | Remote

View on ai-jobs.net

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

View on ai-jobs.net

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

View on ai-jobs.net

Lead Developer (AI)

@ Cere Network | San Francisco, US

View on ai-jobs.net