March 26, 2024, 2:43 p.m. |

Mozilla Foundation Blog foundation.mozilla.org





We’re changing the clip length for donated speech clips in the Common Voice dataset. As a first step, we’ve expanded the limits our platform places on recorded clips to 15 seconds, from the original 10 seconds. This will allow future contributors to donate longer clips, while giving dataset users longer clips for development or research. We’ve expanded this as part of ongoing discussions with our contributor and dataset user community.

When we first set the clip length, the 10 second …

clip contributors dataset development expansion future giving platform speech voice will

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Principal Data Engineering Manager

@ Microsoft | Redmond, Washington, United States

Machine Learning Engineer

@ Apple | San Diego, California, United States