Feb. 21, 2024, 5:43 a.m. | Chakib Fettal, Lazhar Labiod, Mohamed Nadif

cs.LG updates on arXiv.org arxiv.org

arXiv:2402.12890v1 Announce Type: cross
Abstract: This paper explores an empirical approach to learn more discriminantive sentence representations in an unsupervised fashion. Leveraging semantic graph smoothing, we enhance sentence embeddings obtained from pretrained models to improve results for the text clustering and classification tasks. Our method, validated on eight benchmarks, demonstrates consistent improvements, showcasing the potential of semantic graph smoothing in improving sentence embeddings for the supervised and unsupervised document categorization tasks.

abstract arxiv benchmarks classification clustering consistent cs.cl cs.lg embeddings fashion graph improvements learn learn more paper pretrained models semantic tasks text type unsupervised via

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Business Data Scientist, gTech Ads

@ Google | Mexico City, CDMX, Mexico

Lead, Data Analytics Operations

@ Zocdoc | Pune, Maharashtra, India