Improving Neural Machine Translation by Denoising Training. (arXiv:2201.07365v1 [cs.CL]) | allainews.com

Jan. 20, 2022, 2:10 a.m. | Liang Ding, Keqin Peng, Dacheng Tao

cs.CL updates on arXiv.org arxiv.org

We present a simple and effective pretraining strategy {D}en{o}ising
{T}raining DoT for neural machine translation. Specifically, we update the
model parameters with source- and target-side denoising tasks at the early
stage and then tune the model normally. Notably, our approach does not increase
any parameters or training steps, requiring the parallel data merely.
Experiments show that DoT consistently improves the neural machine translation
performance across 12 bilingual and 16 multilingual directions (data size
ranges from 80K to 20M). In addition, …

arxiv machine machine translation neural machine translation training translation

More from arxiv.org / cs.CL updates on arXiv.org

STaR: Distilling Speech Temporal Relation for Lightweight Speech Self-Supervised Learning Models 16 hours ago | arxiv.org

abstract arxiv computational cost +14

Large Language Models can Learn Rules 16 hours ago | arxiv.org

abstract arxiv cs.ai cs.cl +18

Benchmarking LLMs via Uncertainty Quantification 16 hours ago | arxiv.org

abstract arxiv benchmarking bridge +21

CARE: Extracting Experimental Findings From Clinical Literature 16 hours ago | arxiv.org

abstract annotation applications arxiv +16

Prompt Cache: Modular Attention Reuse for Low-Latency Inference 16 hours ago | arxiv.org

abstract arxiv attention cache +20

SpeechAlign: a Framework for Speech Translation Alignment Evaluation 16 hours ago | arxiv.org

abstract advance alignment arxiv +14

I3: Intent-Introspective Retrieval Conditioned on Instructions 16 hours ago | arxiv.org

abstract arxiv challenge cs.cl +10

DoDo Learning: DOmain-DemOgraphic Transfer in Language Models for Detecting Abuse Targeted at Public Figures 16 hours ago | arxiv.org

abstract abuse arxiv automated +19

Investigating the prompt leakage effect and black-box defenses for multi-turn LLM interactions 16 hours ago | arxiv.org

abstract arxiv box cs.ai +24

Data Architect

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

View on ai-jobs.net

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

View on ai-jobs.net

Machine Learning Engineer (m/f/d)

@ StepStone Group | Düsseldorf, Germany

View on ai-jobs.net

2024 GDIA AI/ML Scientist - Supplemental

@ Ford Motor Company | United States

View on ai-jobs.net