Could somebody explain why positional embedding is added to the token embedding? | allainews.com

Feb. 1, 2024, 10:10 p.m. | /u/adeeplearner

Deep Learning www.reddit.com

Hello,

I'm reading this tutorial on positional embedding of a transformer architecture. [https://machinelearningmastery.com/a-gentle-introduction-to-positional-encoding-in-transformer-models-part-1/](https://machinelearningmastery.com/a-gentle-introduction-to-positional-encoding-in-transformer-models-part-1/)

I don't understand the very last part of it:

>What Is the Final Output of the Positional Encoding Layer?
>
>The positional encoding layer sums the positional vector with the word encoding and outputs this matrix for the subsequent layers. The entire process is shown below.

Basically it says the token (word) embedding should be added with the positional embedding. What's the justification for that? …

deeplearning embedding encoding hello layer matrix part positional encoding process token vector word

More from www.reddit.com / Deep Learning

The Vibe I get from the KAN paper 12 hours ago | www.reddit.com

cases deeplearning fun grid +5

Kolmogorov-Arnold Networks (KANs): A Promising Alternative for Better Accuracy and Interpretability in Deep Learning 1 day, 13 hours ago | www.reddit.com

accuracy alternative deep learning deeplearning +2

State of the Art Transfer Anything is IDM-VTON - based on Stable Diffusion 1 day, 14 hours ago | www.reddit.com

art deeplearning diffusion stable diffusion +3

What's your opinions about KAN? 1 day, 17 hours ago | www.reddit.com

deeplearning opinions

I have a hard time understanding the maths behind AI but I also want to … 2 days, 7 hours ago | www.reddit.com

deeplearning maths suggestions understanding

if i was a freelancer deep learning engineer and i work with a company will … 2 days, 20 hours ago | www.reddit.com

computer deep learning deeplearning engineer +6

What does Speaker Embeddings consists of? 3 days ago | www.reddit.com

architecture deeplearning embeddings lstm +2

Physics-Based Deep Learning: Insights into Physics-Informed Neural Networks (PINNs) 3 days, 17 hours ago | www.reddit.com

deep learning deeplearning insights networks +3

How would one write the following loss function in python? I am currently stuck on … 4 days, 6 hours ago | www.reddit.com

deeplearning function loss python

Founding AI Engineer, Agents

@ Occam AI | New York

View on ai-jobs.net

AI Engineer Intern, Agents

@ Occam AI | US

View on ai-jobs.net

AI Research Scientist

@ Vara | Berlin, Germany and Remote

View on ai-jobs.net

Data Architect

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

View on ai-jobs.net