Why does transformers use "fixed" position encoding? | allainews.com

Dec. 3, 2023, 1:07 p.m. | /u/graphitout

Deep Learning www.reddit.com

When I was reading about attention initially, I assumed that the positional encoding is relative (as in difference in positions of query and key). But as per the paper "Attention is all you need", the position encoding vector seems to be fixed. The paper states that:

"We chose this function because we hypothesized it would allow the model to easily learn to attend by relative positions, since for any fixed offset k, P Epos+k can be represented as a linear …

attention attention is all you need deeplearning difference encoding function paper per positional encoding query reading transformers vector

More from www.reddit.com / Deep Learning

The Vibe I get from the KAN paper 6 hours ago | www.reddit.com

cases deeplearning fun grid +5

Kolmogorov-Arnold Networks (KANs): A Promising Alternative for Better Accuracy and Interpretability in Deep Learning 1 day, 7 hours ago | www.reddit.com

accuracy alternative deep learning deeplearning +2

State of the Art Transfer Anything is IDM-VTON - based on Stable Diffusion 1 day, 8 hours ago | www.reddit.com

art deeplearning diffusion stable diffusion +3

What's your opinions about KAN? 1 day, 11 hours ago | www.reddit.com

deeplearning opinions

I have a hard time understanding the maths behind AI but I also want to … 2 days, 2 hours ago | www.reddit.com

deeplearning maths suggestions understanding

if i was a freelancer deep learning engineer and i work with a company will … 2 days, 15 hours ago | www.reddit.com

computer deep learning deeplearning engineer +6

What does Speaker Embeddings consists of? 2 days, 18 hours ago | www.reddit.com

architecture deeplearning embeddings lstm +2

Physics-Based Deep Learning: Insights into Physics-Informed Neural Networks (PINNs) 3 days, 12 hours ago | www.reddit.com

deep learning deeplearning insights networks +3

How would one write the following loss function in python? I am currently stuck on … 4 days, 1 hour ago | www.reddit.com

deeplearning function loss python

Founding AI Engineer, Agents

@ Occam AI | New York

View on ai-jobs.net

AI Engineer Intern, Agents

@ Occam AI | US

View on ai-jobs.net

AI Research Scientist

@ Vara | Berlin, Germany and Remote

View on ai-jobs.net

Data Architect

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

View on ai-jobs.net