Jan. 10, 2022, 2:10 a.m. | Evelina Bakhturina, Vitaly Lavrukhin, Boris Ginsburg

cs.CL updates on arXiv.org arxiv.org

Automatic Speech Recognition and Text-to-Speech systems are primarily trained
in a supervised fashion and require high-quality, accurately labeled speech
datasets. In this work, we examine common problems with speech data and
introduce a toolbox for the construction and interactive error analysis of
speech datasets. The construction tool is based on K\"urzinger et al. work,
and, to the best of our knowledge, the dataset exploration tool is the world's
first open-source tool of this kind. We demonstrate how to apply these …

analysis arxiv construction datasets speech

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Stagista Technical Data Engineer

@ Hager Group | BRESCIA, IT

Data Analytics - SAS, SQL - Associate

@ JPMorgan Chase & Co. | Mumbai, Maharashtra, India