March 11, 2024, 4:47 a.m. | Agnes Luhtaru, Taido Purason, Martin Vainikko, Maksym Del, Mark Fishel

cs.CL updates on arXiv.org arxiv.org

arXiv:2403.05493v1 Announce Type: new
Abstract: This study explores enhancing grammatical error correction (GEC) through artificial error generation (AEG) using language models (LMs). Specifically, we fine-tune Llama 2-based LMs for error generation and find that this approach yields synthetic errors akin to human errors. Next, we train GEC Llama models with the help of these artificial errors and outperform previous state-of-the-art error correction models, with gains ranging between 0.8 and 6 F0.5 points across all tested languages (German, Ukrainian, and Estonian). …

abstract artificial arxiv cs.cl error error correction errors gec human language language models learn llama llama 2 llama models llamas lms next study synthetic through train type

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

Lead Developer (AI)

@ Cere Network | San Francisco, US

Research Engineer

@ Allora Labs | Remote

Ecosystem Manager

@ Allora Labs | Remote

Founding AI Engineer, Agents

@ Occam AI | New York

AI Engineer Intern, Agents

@ Occam AI | US