How good is BERT tokenizer? | allainews.com

Aug. 11, 2022, 3:36 a.m. | /u/Particular-Turn35

Natural Language Processing www.reddit.com

I'm using [BERT pretrained](https://huggingface.co/tftransformers/bert-base-cased) for the 1st time & found smtg weird here. The original word 'demonstrators' has **split into 3 tokens that have different meanings**.

original = "Thousands of demonstrators"
tokenized= ["Thousands", "of", "demons", "##tra", "##tors"]

1. Will this affect the model performance?
2. What's the function of '##' here?

bert good languagetechnology

More from www.reddit.com / Natural Language Processing

Anyone working on mathematics of transformers? 21 hours ago | www.reddit.com

graduate languagetechnology transformers

What Do You Love About NLP? 1 day, 9 hours ago | www.reddit.com

coding communication computer conversations +7

Show Your Work with Confidence: Confidence Bands for Tuning Curves 2 days, 1 hour ago | www.reddit.com

abstract accounting function hyperparameter +11

How to Install and Deploy LLaMA 3 Into Production 2 days, 2 hours ago | www.reddit.com

70b beast easy gpu +8

The Languages AI Is Leaving Behind 5 days, 2 hours ago | www.reddit.com

languages languagetechnology

Feeling so inferior in the NLP job market. 5 days, 22 hours ago | www.reddit.com

job language languagetechnology master +5

NLP: building a sentiment model 5 days, 23 hours ago | www.reddit.com

began building create dataset +12

Online course Recommendations : I need to take some programming and CS or ai courses … 1 week, 1 day ago | www.reddit.com

admissions ai courses computational computer +17

ReFT: Representation Finetuning for Language Models 1 week, 6 days ago | www.reddit.com

abstract languagetechnology

Data Architect

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

View on ai-jobs.net

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

View on ai-jobs.net

Staff Software Engineer, Generative AI, Google Cloud AI

@ Google | Mountain View, CA, USA; Sunnyvale, CA, USA

View on ai-jobs.net

Expert Data Sciences

@ Gainwell Technologies | Any city, CO, US, 99999

View on ai-jobs.net