RoBERTa Bytepiece tokenizer - extracting rep positions from sequences. | allainews.com

May 1, 2022, 11:23 p.m. | /u/PlumOutrageous5625

Natural Language Processing www.reddit.com

So I'm trying to extract the RoBERTa representations for particular words in a sentence, for example:

"The cat in the **hat** went to the pond"

Lets say I'm interested in **hat**. I noticed that after tokenizing, if I tokenize "hat" in isolation, its token ID for hat is different from when I tokenize "the cat in the hat went to the pond"...

Essentially, I'm trying to do a study on contextualized word-reps, but if the rep for **hat** is different …

languagetechnology roberta

More from www.reddit.com / Natural Language Processing

Which NLP-master programs in Europe are more cs-leaning? 11 hours ago | www.reddit.com

computational english europe germany +12

What do you think is the state of the art technique for matching a piece … 2 days, 9 hours ago | www.reddit.com

art city database example +9

Multilabel text classification on unlabled data 2 days, 22 hours ago | www.reddit.com

classification data finance isn +11

I made a text-game where all the LLMs trick each other pretending to be humans. … 3 days, 14 hours ago | www.reddit.com

game humans languagetechnology llms +3

Help with fraud recognition 3 days, 20 hours ago | www.reddit.com

bank code country detection +7

AI-proof language-related jobs in the United States? 5 days, 3 hours ago | www.reddit.com

jobs language languagetechnology management +4

Leveling up RAG 5 days, 12 hours ago | www.reddit.com

advanced advice cleaning context +8

Did we just receive an AI-generated meta-review? 1 week ago | www.reddit.com

generated languagetechnology meta review

Found a Way to Keep Transcripts Going 24/7 1 week, 1 day ago | www.reddit.com

apple apple silicon bugs check +10

AI Engineer Intern, Agents

@ Occam AI | US

View on ai-jobs.net

AI Research Scientist

@ Vara | Berlin, Germany and Remote

View on ai-jobs.net

Data Architect

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

View on ai-jobs.net

Lead Data Modeler

@ Sherwin-Williams | Cleveland, OH, United States

View on ai-jobs.net