June 2, 2023, 9:13 a.m. | /u/OB2211

Machine Learning www.reddit.com

[https://arxiv.org/pdf/2004.12832.pdf](https://arxiv.org/pdf/2004.12832.pdf)

Section 3.2

In the paper they propose to use \[mask\] token as padding tokens for a query, that serves as augmentation.

"We denote the padding with masked tokens as query augmentation, a step that allows BERT to produce query-based embeddings at the positions corresponding to these masks. query augmentation is intended to serve as a soft, diferentiable mechanism for learning to expand queries with new terms or to re-weigh existing terms based on their importance for matching the query." …

augmentation bert embeddings machinelearning masks padding paper query serve tokens

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

Lead Developer (AI)

@ Cere Network | San Francisco, US

Research Engineer

@ Allora Labs | Remote

Ecosystem Manager

@ Allora Labs | Remote

Founding AI Engineer, Agents

@ Occam AI | New York