Feb. 13, 2024, 5:49 a.m. | Daniel de S. Moraes Pedro T. C. Santos Polyana B. da Costa Matheus A. S. Pinto Ivan de J. P. Pinto \'Alvaro M.

cs.CL updates on arXiv.org arxiv.org

This work presents an unsupervised method for automatically constructing and expanding topic taxonomies using instruction-based fine-tuned LLMs (Large Language Models). We apply topic modeling and keyword extraction techniques to create initial topic taxonomies and LLMs to post-process the resulting terms and create a hierarchy. To expand an existing taxonomy with new terms, we use zero-shot prompting to find out where to add new nodes, which, to our knowledge, is the first work to present such an approach to taxonomy tasks. …

apply banking cs.ai cs.cl expansion extraction language language models large language large language models llms modeling process prompting retail tagging taxonomies terms topic modeling transactions unsupervised work zero-shot

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Senior ML Engineer

@ Carousell Group | Ho Chi Minh City, Vietnam

Data and Insight Analyst

@ Cotiviti | Remote, United States