all AI news
Leveraging Weakly Annotated Data for Hate Speech Detection in Code-Mixed Hinglish: A Feasibility-Driven Transfer Learning Approach with Large Language Models
March 5, 2024, 2:52 p.m. | Sargam YadavDundalk Institute of Technology, Dundalk, Abhishek KaushikDundalk Institute of Technology, Dundalk, Kevin McDaidDundalk Institute of Techn
cs.CL updates on arXiv.org arxiv.org
Abstract: The advent of Large Language Models (LLMs) has advanced the benchmark in various Natural Language Processing (NLP) tasks. However, large amounts of labelled training data are required to train LLMs. Furthermore, data annotation and training are computationally expensive and time-consuming. Zero and few-shot learning have recently emerged as viable options for labelling data using large pre-trained models. Hate speech detection in mix-code low-resource languages is an active problem area where the use of LLMs has …
abstract advanced annotated data annotation arxiv benchmark code cs.ai cs.cl data data annotation detection hate speech hate speech detection language language models language processing large language large language models llms mixed natural natural language natural language processing nlp processing speech tasks train training training data transfer transfer learning type
More from arxiv.org / cs.CL updates on arXiv.org
Jobs in AI, ML, Big Data
AI Research Scientist
@ Vara | Berlin, Germany and Remote
Data Architect
@ University of Texas at Austin | Austin, TX
Data ETL Engineer
@ University of Texas at Austin | Austin, TX
Lead GNSS Data Scientist
@ Lurra Systems | Melbourne
Senior Machine Learning Engineer (MLOps)
@ Promaton | Remote, Europe
Business Data Analyst
@ Alstom | Johannesburg, GT, ZA