all AI news
RoBERTa Bytepiece tokenizer - extracting rep positions from sequences.
May 1, 2022, 11:23 p.m. | /u/PlumOutrageous5625
Natural Language Processing www.reddit.com
"The cat in the **hat** went to the pond"
Lets say I'm interested in **hat**. I noticed that after tokenizing, if I tokenize "hat" in isolation, its token ID for hat is different from when I tokenize "the cat in the hat went to the pond"...
Essentially, I'm trying to do a study on contextualized word-reps, but if the rep for **hat** is different …
More from www.reddit.com / Natural Language Processing
What do you think is the state of the art technique for matching a piece …
2 days, 9 hours ago |
www.reddit.com
Multilabel text classification on unlabled data
2 days, 22 hours ago |
www.reddit.com
AI-proof language-related jobs in the United States?
5 days, 3 hours ago |
www.reddit.com
Found a Way to Keep Transcripts Going 24/7
1 week, 1 day ago |
www.reddit.com
Jobs in AI, ML, Big Data
AI Engineer Intern, Agents
@ Occam AI | US
AI Research Scientist
@ Vara | Berlin, Germany and Remote
Data Architect
@ University of Texas at Austin | Austin, TX
Data ETL Engineer
@ University of Texas at Austin | Austin, TX
Lead GNSS Data Scientist
@ Lurra Systems | Melbourne
Lead Data Modeler
@ Sherwin-Williams | Cleveland, OH, United States