all AI news
How can I crowdsource speech/translation data for an endangered Semitic language?
May 25, 2022, 3:50 a.m. | /u/Foofalo
Natural Language Processing www.reddit.com
**Mission:** I am trying to crowdsources speech and translation data for an endangered Semitic language: Neo-Aramaic
**What I have done already:** There are many many dialects of spoken Neo-Aramaic (not including Classical Syriac). Documenting all of them is important. But for the most common dialect specifically (Urmi), I have a \~100 page bilingual English/Assyrian corpus in the desired phonetic Romanized [alphabet](https://nena.ames.cam.ac.uk/audio/200/). I have also created an ASR model with 12% CER to semi-automate the transcription of future unlabeled …
More from www.reddit.com / Natural Language Processing
Jobs in AI, ML, Big Data
Senior ML Researcher - 3D Geometry Processing | 3D Shape Generation | 3D Mesh Data
@ Promaton | Europe
Principal Data Engineer
@ RS21 | Remote
SQL/Power BI Developer
@ ICF | Virginia Remote Office (VA99)
Senior Machine Learning Engineer (Canada Remote)
@ Fullscript | Ottawa, ON
Software Engineer - MLOps.
@ Renesas Electronics | Toyosu, Japan
Junior Data Scientist / Artificial Intelligence consultant
@ Deloitte | Luxembourg, LU