March 30, 2022, 1:22 p.m. | /u/jamescalam

Natural Language Processing www.reddit.com

Hi all, I wanted to share an article I wrote covering [Generative Pseudo-Labeling (GPL)](https://www.pinecone.io/learn/gpl/). It's a really cool technique from Kexin Wang and co (N Reimers' UKPLab) that allows us to adapt sentence transformer models to new domains using nothing more than unlabeled text data. It works by generating synthetic (query, positive, negative) triplets and margin scores, which are then used with margin MSE loss to fine-tune the sentence transformer.

Very interesting technique, if you're working in the space I'd …

labeling languagetechnology learning search semantic unsupervised unsupervised learning

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Data Management Associate

@ EcoVadis | Ebène, Mauritius

Senior Data Engineer

@ Telstra | Telstra ICC Bengaluru