MALTO at SemEval-2024 Task 6: Leveraging Synthetic Data for LLM Hallucination Detection | allainews.com

March 5, 2024, 2:43 p.m. | Federico Borra, Claudio Savelli, Giacomo Rosso, Alkis Koudounas, Flavio Giobergia

cs.LG updates on arXiv.org arxiv.org

arXiv:2403.00964v1 Announce Type: cross
Abstract: In Natural Language Generation (NLG), contemporary Large Language Models (LLMs) face several challenges, such as generating fluent yet inaccurate outputs and reliance on fluency-centric metrics. This often leads to neural networks exhibiting "hallucinations". The SHROOM challenge focuses on automatically identifying these hallucinations in the generated text. To tackle these issues, we introduce two key components, a data augmentation pipeline incorporating LLM-assisted pseudo-labelling and sentence rephrasing, and a voting ensemble from three models pre-trained on Natural …

abstract arxiv challenge challenges cs.cl cs.lg data detection face hallucination hallucinations language language generation language models large language large language models leads llm llm hallucination llms metrics natural natural language natural language generation networks neural networks nlg reliance synthetic synthetic data type

More from arxiv.org / cs.LG updates on arXiv.org

(Accelerated) Noise-adaptive Stochastic Heavy-Ball Momentum 1 day, 17 hours ago | arxiv.org

abstract aim arxiv cs.lg +12

Nash Learning from Human Feedback 1 day, 17 hours ago | arxiv.org

abstract arxiv cs.ai cs.gt +20

GraphDreamer: Compositional 3D Scene Synthesis from Scene Graphs 1 day, 17 hours ago | arxiv.org

abstract arxiv become cs.cv +16

Trainwreck: A damaging adversarial attack on image classifiers 1 day, 17 hours ago | arxiv.org

adversarial arxiv classifiers cs.cr +5

Fast Controllable Diffusion Models for Undersampled MRI Reconstruction 1 day, 17 hours ago | arxiv.org

abstract acquisition arxiv cs.lg +13

MAD Max Beyond Single-Node: Enabling Large Machine Learning Model Acceleration on Distributed Systems 1 day, 17 hours ago | arxiv.org

abstract analysis arxiv beyond +24

From Classification to Segmentation with Explainable AI: A Study on Crack Detection and Growth Monitoring 1 day, 17 hours ago | arxiv.org

abstract arxiv classification cs.cv +22

Exploring Meta Information for Audio-based Zero-shot Bird Classification 1 day, 17 hours ago | arxiv.org

abstract advances arxiv audio +22

Occlusion-Aware Deep Convolutional Neural Network via Homogeneous Tanh-transforms for Face Parsing 1 day, 17 hours ago | arxiv.org

abstract arxiv become convolutional +16

Senior Machine Learning Engineer

@ GPTZero | Toronto, Canada

View on ai-jobs.net

Sr. Data Operations

@ Carousell Group | West Jakarta, Indonesia

View on ai-jobs.net

Senior Analyst, Business Intelligence & Reporting

@ Deutsche Bank | Bucharest

View on ai-jobs.net

Business Intelligence Subject Matter Expert (SME) - Assistant Vice President

@ Deutsche Bank | Cary, 3000 CentreGreen Way

View on ai-jobs.net

Enterprise Business Intelligence Specialist

@ NAIC | Kansas City

View on ai-jobs.net

Senior Business Intelligence (BI) Developer - Associate

@ Deutsche Bank | Cary, 3000 CentreGreen Way

View on ai-jobs.net