[D] Confusion about masking in BERT model | allainews.com

Jan. 27, 2022, 7:51 p.m. | /u/mrtac96

Machine Learning www.reddit.com

I am trying to understand the masking in BERT model.

I have confusion in following line taken from paper

The training data generator chooses 15% of the token positions at random for prediction. If the i-th token is chosen, we replace the i-th token with (1) the [MASK] token 80% of the time (2) a random token 10% of the time (3) the unchanged i-th token 10% of the time

at point 3 it say unchanged token (i think it …

bert machinelearning

More from www.reddit.com / Machine Learning

[Discussion] Are there specific technical/scientific breakthroughs that have allowed the significant jump in maximum context … 7 hours ago | www.reddit.com

claude context gpt gpt-4 +14

[D] How to evaluate RAG - both retrieval and generation, when all I have is … 9 hours ago | www.reddit.com

data documents embedding embedding models +7

[D] Has anyone tried distilling large language models the old way? 13 hours ago | www.reddit.com

distillation however language language model +9

[D] Llama-3 (7B and 70B) on a medical domain benchmark 19 hours ago | www.reddit.com

70b ai community benchmark community +10

[D] Data Scientist: job preparation guide 2024 19 hours ago | www.reddit.com

data data scientist genai guide +7

[D] ICML Meta Reviews 20 hours ago | www.reddit.com

machinelearning

[R] Show Your Work with Confidence: Confidence Bands for Tuning Curves 21 hours ago | www.reddit.com

abstract accounting function hyperparameter +11

[R] InternVL v1.5 open sourced, ranking first in OpenCompass multi-modal benchmark 21 hours ago | www.reddit.com

benchmark cvpr demo download +7

[N] Meta releases Llama 3 21 hours ago | www.reddit.com

machinelearning

(373) Applications Manager – Business Intelligence - BSTD

@ South African Reserve Bank | South Africa

View on ai-jobs.net

Data Engineer Talend (confirmé/sénior) - H/F - CDI

@ Talan | Paris, France

View on ai-jobs.net

Data Science Intern (Summer) / Stagiaire en données (été)

@ BetterSleep | Montreal, Quebec, Canada

View on ai-jobs.net

Director - Master Data Management (REMOTE)

@ Wesco | Pittsburgh, PA, United States

View on ai-jobs.net

Architect Systems BigData REF2649A

@ Deutsche Telekom IT Solutions | Budapest, Hungary

View on ai-jobs.net

Data Product Coordinator

@ Nestlé | São Paulo, São Paulo, BR, 04730-000

View on ai-jobs.net