Word2vec/LSTMs and subword tokenization | allainews.com

July 19, 2023, 8:56 p.m. | /u/Captain_Flashheart

Natural Language Processing www.reddit.com

A data scientist on our team is curious what would happen if we'd use subword tokenization (bert tokenization) as the tokenization step for our conventional models (word2vec, CNNs, LSTMs). The word2vec model is used for recommendation and clustering in addition to serving "just" as the embedding layers of other models. We said we'd try it out.

My own intuition is that it would decrease the quality of the word2vec model, since we want this model specifically to distinguish between things …

bert clustering cnns data data scientist embedding languagetechnology recommendation team tokenization word2vec

More from www.reddit.com / Natural Language Processing

Which NLP-master programs in Europe are more cs-leaning? 3 days, 2 hours ago | www.reddit.com

computational english europe germany +12

What do you think is the state of the art technique for matching a piece … 5 days ago | www.reddit.com

art city database example +9

Multilabel text classification on unlabled data 5 days, 13 hours ago | www.reddit.com

classification data finance isn +11

I made a text-game where all the LLMs trick each other pretending to be humans. … 6 days, 5 hours ago | www.reddit.com

game humans languagetechnology llms +3

Help with fraud recognition 6 days, 12 hours ago | www.reddit.com

bank code country detection +7

AI-proof language-related jobs in the United States? 1 week ago | www.reddit.com

jobs language languagetechnology management +4

Leveling up RAG 1 week, 1 day ago | www.reddit.com

advanced advice cleaning context +8

Did we just receive an AI-generated meta-review? 1 week, 3 days ago | www.reddit.com

generated languagetechnology meta review

Found a Way to Keep Transcripts Going 24/7 1 week, 3 days ago | www.reddit.com

apple apple silicon bugs check +10

Founding AI Engineer, Agents

@ Occam AI | New York

View on ai-jobs.net

AI Engineer Intern, Agents

@ Occam AI | US

View on ai-jobs.net

AI Research Scientist

@ Vara | Berlin, Germany and Remote

View on ai-jobs.net

Data Architect

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Data Scientist (Database Development)

@ Nasdaq | Bengaluru-Affluence

View on ai-jobs.net