March 27, 2024, 4:48 a.m. | Christopher Bagdon, Prathamesh Karmalker, Harsha Gurulingappa, Roman Klinger

cs.CL updates on arXiv.org arxiv.org

arXiv:2403.17612v1 Announce Type: new
Abstract: Labeling corpora constitutes a bottleneck to create models for new tasks or domains. Large language models mitigate the issue with automatic corpus labeling methods, particularly for categorical annotations. Some NLP tasks such as emotion intensity prediction, however, require text regression, but there is no work on automating annotations for continuous label assignments. Regression is considered more challenging than classification: The fact that humans perform worse when tasked to choose values from a rating scale lead …

abstract annotations arxiv categorical cs.cl domains emotion expert however intensity issue labeling language language models large language large language models modeling nlp prediction regression scaling tasks text type

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Reporting & Data Analytics Lead (Sizewell C)

@ EDF | London, GB

Data Analyst

@ Notable | San Mateo, CA