all AI news
[D] Confusion about masking in BERT model
Web: https://www.reddit.com/r/MachineLearning/comments/se6z01/d_confusion_about_masking_in_bert_model/
Jan. 27, 2022, 7:51 p.m. | /u/mrtac96
Machine Learning reddit.com
I am trying to understand the masking in BERT model.
I have confusion in following line taken from paper
The training data generator chooses 15% of the token positions at random for prediction. If the i-th token is chosen, we replace the i-th token with (1) the [MASK] token 80% of the time (2) a random token 10% of the time (3) the unchanged i-th token 10% of the time
at point 3 it say unchanged token (i think it …
!-->More from reddit.com / Machine Learning
Latest AI/ML/Big Data Jobs
Clinical Assistant or Associate Professor of Management Science and Systems
@ University at Buffalo | Buffalo, NY
Data Analyst
@ Colorado Springs Police Department | Colorado Springs, CO
Predictive Ecology Postdoctoral Fellow
@ Lawrence Berkeley National Lab | Berkeley, CA
Data Analyst, Patagonia Action Works
@ Patagonia | Remote
Data & Insights Strategy & Innovation General Manager
@ Chevron Services Company, a division of Chevron U.S.A Inc. | Houston, TX
Faculty members in Research areas such as Bayesian and Spatial Statistics; Data Privacy and Security; AI/ML; NLP; Image and Video Data Analysis
@ Ahmedabad University | Ahmedabad, India