On the Economics of Multilingual Few-shot Learning: Modeling the Cost-Performance Trade-offs of Machine Translated and Manual Data. (arXiv:2205.06350v2 [cs.CL] UPDATED) | allainews.com

Nov. 15, 2022, 2:16 a.m. | Kabir Ahuja, Monojit Choudhury, Sandipan Dandapat

cs.CL updates on arXiv.org arxiv.org

Borrowing ideas from {\em Production functions} in micro-economics, in this
paper we introduce a framework to systematically evaluate the performance and
cost trade-offs between machine-translated and manually-created labelled data
for task-specific fine-tuning of massively multilingual language models. We
illustrate the effectiveness of our framework through a case-study on the
TyDIQA-GoldP dataset. One of the interesting conclusions of the study is that
if the cost of machine translation is greater than zero, the optimal
performance at least cost is always achieved …

arxiv cost data economics few-shot learning machine modeling performance trade

More from arxiv.org / cs.CL updates on arXiv.org

Vesper: A Compact and Effective Pretrained Model for Speech Emotion Recognition 1 day, 1 hour ago | arxiv.org

abstract artificial artificial general intelligence arxiv +19

Visually grounded few-shot word learning in low-resource settings 1 day, 1 hour ago | arxiv.org

abstract arxiv cs.cl eess.as +16

KTRL+F: Knowledge-Augmented In-Document Search 1 day, 1 hour ago | arxiv.org

abstract arxiv challenges cs.cl +12

Knowledgeable Preference Alignment for LLMs in Domain-specific Question Answering 1 day, 1 hour ago | arxiv.org

abstract alignment applications arxiv +19

Hint-enhanced In-Context Learning wakes Large Language Models up for knowledge-intensive tasks 1 day, 1 hour ago | arxiv.org

abstract arxiv context cs.cl +17

LibriSQA: A Novel Dataset and Framework for Spoken Question Answering with Large Language Models 1 day, 1 hour ago | arxiv.org

arxiv cs.cl dataset framework +9

Efficient Sentiment Analysis: A Resource-Aware Evaluation of Feature Extraction Techniques, Ensembling, and Deep Learning Models 1 day, 1 hour ago | arxiv.org

abstract accuracy analysis arxiv +18

Self-Polish: Enhance Reasoning in Large Language Models via Problem Refinement 1 day, 1 hour ago | arxiv.org

arxiv cs.ai cs.cl language +6

MFE-NER: Multi-feature Fusion Embedding for Chinese Named Entity Recognition 1 day, 1 hour ago | arxiv.org

abstract arxiv characters chinese +10

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

View on ai-jobs.net

Data Analyst

@ SEAKR Engineering | Englewood, CO, United States

View on ai-jobs.net

Data Analyst II

@ Postman | Bengaluru, India

View on ai-jobs.net

Data Architect

@ FORSEVEN | Warwick, GB

View on ai-jobs.net

Director, Data Science

@ Visa | Washington, DC, United States

View on ai-jobs.net

Senior Manager, Data Science - Emerging ML

@ Capital One | McLean, VA

View on ai-jobs.net