Why does model converges faster with Label encoded data but very slowly with one hot encoding? | allainews.com

Feb. 26, 2024, 1 p.m. | /u/mono1110

Deep Learning www.reddit.com

Quick context. I am trying to solve a very easy problem with self attention.

Let me provide the approaches I tried.

1. Training data was one hot encoded, then positional encoding was added. After that self attention followed by two feedforward layers.

2. Training data was label encoded, passed through an embedding layer, positional encoding was added. After that it was passed through self attention followed by two feedforward layers.

Now the results.
In the first approach, the training accuracy …

attention context data deeplearning easy encoding faster hot positional encoding solve training training data

More from www.reddit.com / Deep Learning

Kolmogorov-Arnold Networks (KANs): A Promising Alternative for Better Accuracy and Interpretability in Deep Learning 1 day, 5 hours ago | www.reddit.com

accuracy alternative deep learning deeplearning +2

State of the Art Transfer Anything is IDM-VTON - based on Stable Diffusion 1 day, 6 hours ago | www.reddit.com

art deeplearning diffusion stable diffusion +3

What's your opinions about KAN? 1 day, 9 hours ago | www.reddit.com

deeplearning opinions

I have a hard time understanding the maths behind AI but I also want to … 2 days ago | www.reddit.com

deeplearning maths suggestions understanding

if i was a freelancer deep learning engineer and i work with a company will … 2 days, 13 hours ago | www.reddit.com

computer deep learning deeplearning engineer +6

What does Speaker Embeddings consists of? 2 days, 17 hours ago | www.reddit.com

architecture deeplearning embeddings lstm +2

Physics-Based Deep Learning: Insights into Physics-Informed Neural Networks (PINNs) 3 days, 10 hours ago | www.reddit.com

deep learning deeplearning insights networks +3

How would one write the following loss function in python? I am currently stuck on … 3 days, 23 hours ago | www.reddit.com

deeplearning function loss python

Tensorflow vs pytorch 4 days, 3 hours ago | www.reddit.com

deep learning deeplearning hey library +5

AI Engineer Intern, Agents

@ Occam AI | US

View on ai-jobs.net

AI Research Scientist

@ Vara | Berlin, Germany and Remote

View on ai-jobs.net

Data Architect

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

View on ai-jobs.net

Data Engineer - Takealot Group (Takealot.com | Superbalist.com | Mr D Food)

@ takealot.com | Cape Town

View on ai-jobs.net