Web: https://www.reddit.com/r/LanguageTechnology/comments/wh2til/clustering_techniques/

Aug. 5, 2022, 6:34 p.m. | /u/nlp_learner23

Natural Language Processing reddit.com

I'm looking to convert clusters that contain phrases generated through an unsupervised clustering algorithm into a classification problem. The classification model performs poorly on some classes because the clusters are not cohesive. I have looked at methods such as silhouette coefficient and cosine similarity. Are there any methods to clean up clusters, merge similar clusters or even split clusters? I would like to improve the quality of the clusters to improve the accuracy of my classification model.

clustering languagetechnology

Engineering Manager, Machine Learning (Credit Engineering)

@ Affirm | Remote Poland

Sr Data Engineer

@ Rappi | [CO] Bogotá

Senior Analytics Engineer

@ GetGround | Porto

Senior Staff Software Engineer, Data Engineering

@ Galileo, Inc. | New York City or Remote

Data Engineer

@ Atlassian | Bengaluru, India

Data Engineer | Hybrid (Pune)

@ Velotio | Pune, Maharashtra, India