April 26, 2024, 4:47 a.m. | Chanho Park, Mingjie Chen, Thomas Hain

cs.CL updates on arXiv.org arxiv.org

arXiv:2404.16743v1 Announce Type: new
Abstract: Word error rate (WER) is a metric used to evaluate the quality of transcriptions produced by Automatic Speech Recognition (ASR) systems. In many applications, it is of interest to estimate WER given a pair of a speech utterance and a transcript. Previous work on WER estimation focused on building models that are trained with a specific ASR system in mind (referred to as ASR system-dependent). These are also domain-dependent and inflexible in real-world applications. In …

abstract applications arxiv asr automatic speech recognition cs.cl cs.sd eess.as error independent quality rate recognition speech speech recognition systems type word work

Founding AI Engineer, Agents

@ Occam AI | New York

AI Engineer Intern, Agents

@ Occam AI | US

AI Research Scientist

@ Vara | Berlin, Germany and Remote

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Machine Learning Research Scientist

@ d-Matrix | San Diego, Ca