all AI news
Efficacy of ByteT5 in Multilingual Translation of Biblical Texts for Underrepresented Languages
May 24, 2024, 4:45 a.m. | Corinne Aars, Lauren Adams, Xiaokan Tian, Zhaoyu Wang, Colton Wismer, Jason Wu, Pablo Rivas, Korn Sooksatra, Matthew Fendt
cs.LG updates on arXiv.org arxiv.org
Abstract: This study presents the development and evaluation of a ByteT5-based multilingual translation model tailored for translating the Bible into underrepresented languages. Utilizing the comprehensive Johns Hopkins University Bible Corpus, we trained the model to capture the intricate nuances of character-based and morphologically rich languages. Our results, measured by the BLEU score and supplemented with sample translations, suggest the model can improve accessibility to sacred texts. It effectively handles the distinctive biblical lexicon and structure, thus …
abstract arxiv cs.cl cs.lg development evaluation johns hopkins university languages multilingual study translation type university
More from arxiv.org / cs.LG updates on arXiv.org
Machine-learned models for magnetic materials
2 days, 13 hours ago |
arxiv.org
Revisiting RIP guarantees for sketching operators on mixture models
2 days, 13 hours ago |
arxiv.org
Jobs in AI, ML, Big Data
Senior Data Engineer
@ Displate | Warsaw
Junior Data Analyst - ESG Data
@ Institutional Shareholder Services | Mumbai
Intern Data Driven Development in Sensor Fusion for Autonomous Driving (f/m/x)
@ BMW Group | Munich, DE
Senior MLOps Engineer, Machine Learning Platform
@ GetYourGuide | Berlin
Data Engineer, Analytics
@ Meta | Menlo Park, CA
Data Engineer
@ Meta | Menlo Park, CA