May 11, 2022, 3:45 p.m. | /u/jamescalam

Natural Language Processing www.reddit.com

Hi all, I've been learning about BERTopic recently, an incredibly simple to use library that allows us to do some seriously cool stuff in topic modeling. I dived into the details in [this article](https://www.pinecone.io/learn/bertopic/), in short:

* Uses transformer models to generate *meaningful* vector representations of text
* Fascinating techniques like UMAP and HDBSCAN are used to produce clusters from these vector representations
* A modified TF-IDF (called c-TF-IDF) finds the most relevant keywords for each cluster and assigns these …

future languagetechnology modeling nlp topic modeling

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Applied Scientist, Control Stack, AWS Center for Quantum Computing

@ Amazon.com | Pasadena, California, USA

Specialist Marketing with focus on ADAS/AD f/m/d

@ AVL | Graz, AT

Machine Learning Engineer, PhD Intern

@ Instacart | United States - Remote

Supervisor, Breast Imaging, Prostate Center, Ultrasound

@ University Health Network | Toronto, ON, Canada

Senior Manager of Data Science (Recommendation Science)

@ NBCUniversal | New York, NEW YORK, United States