Towards Large Language Model driven Reference-less Translation Evaluation for English and Indian Languages | allainews.com

April 4, 2024, 4:47 a.m. | Vandan Mujadia, Pruthwik Mishra, Arafat Ahsan, Dipti Misra Sharma

cs.CL updates on arXiv.org arxiv.org

arXiv:2404.02512v1 Announce Type: new
Abstract: With the primary focus on evaluating the effectiveness of large language models for automatic reference-less translation assessment, this work presents our experiments on mimicking human direct assessment to evaluate the quality of translations in English and Indian languages. We constructed a translation evaluation task where we performed zero-shot learning, in-context example-driven learning, and fine-tuning of large language models to provide a score out of 100, where 100 represents a perfect translation and 1 represents a …

abstract arxiv assessment cs.cl english evaluation focus human indian indian languages language language model language models languages large language large language model large language models quality reference translation translations type work

More from arxiv.org / cs.CL updates on arXiv.org

Designing LLM Chains by Adapting Techniques from Crowdsourcing Workflows 22 hours ago | arxiv.org

abstract arxiv crowdsourcing cs.ai +13

GraphGPT: Graph Instruction Tuning for Large Language Models 22 hours ago | arxiv.org

arxiv cs.ai cs.cl graph +6

How Fragile is Relation Extraction under Entity Replacements? 22 hours ago | arxiv.org

arxiv cs.ai cs.cl extraction +1

Granite Code Models: A Family of Open Foundation Models for Code Intelligence 22 hours ago | arxiv.org

abstract agents arxiv code +25

Enriched BERT Embeddings for Scholarly Publication Classification 22 hours ago | arxiv.org

abstract academic articles arxiv +16

Sketch Then Generate: Providing Incremental User Feedback and Guiding LLM Code Generation through Language-Oriented Code … 22 hours ago | arxiv.org

abstract arxiv code code generation +20

HAFFormer: A Hierarchical Attention-Free Framework for Alzheimer's Disease Detection From Spontaneous Speech 22 hours ago | arxiv.org

abstract alzheimer's architectures arxiv +22

CleanGraph: Human-in-the-loop Knowledge Graph Refinement and Completion 22 hours ago | arxiv.org

arxiv cs.ai cs.cl graph +5

Conformity, Confabulation, and Impersonation: Persona Inconstancy in Multi-Agent LLM Collaboration 22 hours ago | arxiv.org

abstract agent agents analyze +19

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

View on ai-jobs.net

Lead Developer (AI)

@ Cere Network | San Francisco, US

View on ai-jobs.net

Research Engineer

@ Allora Labs | Remote

View on ai-jobs.net

Ecosystem Manager

@ Allora Labs | Remote

View on ai-jobs.net

Founding AI Engineer, Agents

@ Occam AI | New York

View on ai-jobs.net

AI Engineer Intern, Agents

@ Occam AI | US

View on ai-jobs.net