April 4, 2024, 4:47 a.m. | Vandan Mujadia, Pruthwik Mishra, Arafat Ahsan, Dipti Misra Sharma

cs.CL updates on arXiv.org arxiv.org

arXiv:2404.02512v1 Announce Type: new
Abstract: With the primary focus on evaluating the effectiveness of large language models for automatic reference-less translation assessment, this work presents our experiments on mimicking human direct assessment to evaluate the quality of translations in English and Indian languages. We constructed a translation evaluation task where we performed zero-shot learning, in-context example-driven learning, and fine-tuning of large language models to provide a score out of 100, where 100 represents a perfect translation and 1 represents a …

abstract arxiv assessment cs.cl english evaluation focus human indian indian languages language language model language models languages large language large language model large language models quality reference translation translations type work

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

Lead Developer (AI)

@ Cere Network | San Francisco, US

Research Engineer

@ Allora Labs | Remote

Ecosystem Manager

@ Allora Labs | Remote

Founding AI Engineer, Agents

@ Occam AI | New York

AI Engineer Intern, Agents

@ Occam AI | US