Transformers changing embedding size (pytorch) | allainews.com

Aug. 10, 2022, 5:58 p.m. | /u/Silly_Ad_4008

Natural Language Processing www.reddit.com

Since embeddings in pytorch acts as lookup table, Is there any difference between these two codes?

(model.shared means embedding layer (T5 Transformer))

model.shared = new_emb
model.lm_head = new_head

and

model.shared.weight = new_emb.weight
model.lm_head.weight = new_head.weight

The reason that i am asking this question is:

When i use both, i get different loss values (cross-validation loss)

Loss for code piece 1: [https://i.stack.imgur.com/D0sz7.png](https://i.stack.imgur.com/D0sz7.png)

Loss for code piece 2:[https://i.stack.imgur.com/FvhMW.png](https://i.stack.imgur.com/FvhMW.png)

embedding languagetechnology pytorch transformers

More from www.reddit.com / Natural Language Processing

PhD in Linguistics: Which skills should I focus on? 21 hours ago | www.reddit.com

communication computer computer science fields +12

Is the MA in computational linguistics that bad in Tubingen ? 1 day, 5 hours ago | www.reddit.com

computational languagetechnology linguistics

Which NLP-master programs in Europe are more cs-leaning? 4 days, 23 hours ago | www.reddit.com

computational english europe germany +12

What do you think is the state of the art technique for matching a piece … 6 days, 21 hours ago | www.reddit.com

art city database example +9

Multilabel text classification on unlabled data 1 week ago | www.reddit.com

classification data finance isn +11

I made a text-game where all the LLMs trick each other pretending to be humans. … 1 week, 1 day ago | www.reddit.com

game humans languagetechnology llms +3

Help with fraud recognition 1 week, 1 day ago | www.reddit.com

bank code country detection +7

AI-proof language-related jobs in the United States? 1 week, 2 days ago | www.reddit.com

jobs language languagetechnology management +4

Leveling up RAG 1 week, 3 days ago | www.reddit.com

advanced advice cleaning context +8

Lead Developer (AI)

@ Cere Network | San Francisco, US

View on ai-jobs.net

Research Engineer

@ Allora Labs | Remote

View on ai-jobs.net

Ecosystem Manager

@ Allora Labs | Remote

View on ai-jobs.net

Founding AI Engineer, Agents

@ Occam AI | New York

View on ai-jobs.net

AI Engineer Intern, Agents

@ Occam AI | US

View on ai-jobs.net

AI Research Scientist

@ Vara | Berlin, Germany and Remote

View on ai-jobs.net