Jan. 31, 2024, 4:41 p.m. | Shira Wein, Nathan Schneider

cs.CL updates on arXiv.org arxiv.org

Translated texts bear several hallmarks distinct from texts originating in
the language. Though individual translated texts are often fluent and preserve
meaning, at a large scale, translated texts have statistical tendencies which
distinguish them from text originally written in the language
("translationese") and can affect model performance. We frame the novel task of
translationese reduction and hypothesize that Abstract Meaning Representation
(AMR), a graph-based semantic representation which abstracts away from the
surface form, can be used as an interlingua to …

abstract arxiv cs.cl language lost meaning performance representation scale statistical text them translated translation

Software Engineer for AI Training Data (School Specific)

@ G2i Inc | Remote

Software Engineer for AI Training Data (Python)

@ G2i Inc | Remote

Software Engineer for AI Training Data (Tier 2)

@ G2i Inc | Remote

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

Lead Developer (AI)

@ Cere Network | San Francisco, US