Improving Grammatical Error Correction via Contextual Data Augmentation | allainews.com

June 26, 2024, 4:42 a.m. | Yixuan Wang, Baoxin Wang, Yijun Liu, Qingfu Zhu, Dayong Wu, Wanxiang Che

cs.CL updates on arXiv.org arxiv.org

arXiv:2406.17456v1 Announce Type: new
Abstract: Nowadays, data augmentation through synthetic data has been widely used in the field of Grammatical Error Correction (GEC) to alleviate the problem of data scarcity. However, these synthetic data are mainly used in the pre-training phase rather than the data-limited fine-tuning phase due to inconsistent error distribution and noisy labels. In this paper, we propose a synthetic data construction method based on contextual augmentation, which can ensure an efficient augmentation of the original data with …

abstract arxiv augmentation cs.ai cs.cl data distribution error error correction fine-tuning gec however improving pre-training problem synthetic synthetic data through training tuning type via

More from arxiv.org / cs.CL updates on arXiv.org

MuTox: Universal MUltilingual Audio-based TOXicity Dataset and Zero-shot Detector 1 day, 2 hours ago | arxiv.org

abstract arxiv audio cs.cl +22

Can Large Language Model Summarizers Adapt to Diverse Scientific Communication Goals? 1 day, 2 hours ago | arxiv.org

abstract adapt arxiv communication +23

ReFT: Reasoning with Reinforced Fine-Tuning 1 day, 2 hours ago | arxiv.org

abstract annotations arxiv capability +22

Deductive Closure Training of Language Models for Coherence, Accuracy, and Updatability 1 day, 2 hours ago | arxiv.org

abstract accuracy arxiv cs.cl +13

Exploring Defeasibility in Causal Reasoning 1 day, 2 hours ago | arxiv.org

abstract arxiv causal causal reasoning +7

Can Large Language Models Follow Concept Annotation Guidelines? A Case Study on Scientific and Financial … 1 day, 2 hours ago | arxiv.org

abstract annotation arxiv capacity +26

Theory of Mind for Multi-Agent Collaboration via Large Language Models 1 day, 2 hours ago | arxiv.org

abstract agent agents arxiv +28

Enhancing Text-based Knowledge Graph Completion with Zero-Shot Large Language Models: A Focus on Semantic Enhancement 1 day, 2 hours ago | arxiv.org

arxiv cs.ai cs.cl focus +12

A Large Language Model Approach to Educational Survey Feedback Analysis 1 day, 2 hours ago | arxiv.org

abstract analysis arxiv capabilities +27

Performance Marketing Manager

@ Jerry | New York City

View on ai-jobs.net

Senior Growth Marketing Manager (FULLY REMOTE)

@ Jerry | Seattle, WA

View on ai-jobs.net

Growth Marketing Channel Manager

@ Jerry | New York City

View on ai-jobs.net

Azure Integration Developer - Consultant - Bangalore

@ KPMG India | Bengaluru, Karnataka, India

View on ai-jobs.net

Director - Technical Program Manager

@ Capital One | Bengaluru, In

View on ai-jobs.net

Lead Developer-Process Automation -Python Developer

@ Diageo | Bengaluru Karle Town SEZ

View on ai-jobs.net