Bengali Common Voice Speech Dataset for Automatic Speech Recognition. (arXiv:2206.14053v1 [cs.CL]) | allainews.com

June 29, 2022, 1:12 a.m. | Samiul Alam, Asif Sushmit, Zaowad Abdullah, Shahrin Nakkhatra, MD. Nazmuddoha Ansary, Syed Mobassir Hossen, Sazia Morshed Mehnaz, Tahsin Reasat, Ahmed

cs.CL updates on arXiv.org arxiv.org

Bengali is one of the most spoken languages in the world with over 300
million speakers globally. Despite its popularity, research into the
development of Bengali speech recognition systems is hindered due to the lack
of diverse open-source datasets. As a way forward, we have crowdsourced the
Bengali Common Voice Speech Dataset, which is a sentence-level automatic speech
recognition corpus. Collected on the Mozilla Common Voice platform, the dataset
is part of an ongoing campaign that has led to the …

arxiv automatic speech recognition dataset speech speech recognition voice

More from arxiv.org / cs.CL updates on arXiv.org

A Text Classification Framework for Simple and Effective Early Depression Detection Over Social Media Streams 23 hours ago | arxiv.org

abstract arxiv build classification +22

A Survey on Prompting Techniques in LLMs 23 hours ago | arxiv.org

abstract arxiv autoregressive cs.ai +24

Enabling On-Device Large Language Model Personalization with Self-Supervised Data Selection and Synthesis 23 hours ago | arxiv.org

abstract arxiv conversation cs.cl +21

ML-Bench: Evaluating Large Language Models for Code Generation in Repository-Level Machine Learning Tasks 23 hours ago | arxiv.org

arxiv code code generation cs.ai +9

Strings from the Library of Babel: Random Sampling as a Strong Baseline for Prompt Optimisation 23 hours ago | arxiv.org

abstract arxiv cs.ai cs.cl +16

Assessing Logical Puzzle Solving in Large Language Models: Insights from a Minesweeper Case Study 23 hours ago | arxiv.org

abstract arxiv case case study +19

Formal Aspects of Language Modeling 23 hours ago | arxiv.org

abstract artificial artificial intelligence arxiv +24

Qilin-Med: Multi-stage Knowledge Injection Advanced Medical Large Language Model 23 hours ago | arxiv.org

abstract advanced arxiv challenges +24

Predicting Emergent Abilities with Infinite Resolution Evaluation 23 hours ago | arxiv.org

abstract arxiv cs.cl evaluation +19

Data Scientist (m/f/x/d)

@ Symanto Research GmbH & Co. KG | Spain, Germany

View on ai-jobs.net

Enterprise Data Architect

@ Pathward | Remote

View on ai-jobs.net

Diagnostic Imaging Information Systems (DIIS) Technologist

@ Nova Scotia Health Authority | Halifax, NS, CA, B3K 6R8

View on ai-jobs.net

Intern Data Scientist - Residual Value Risk Management (f/m/d)

@ BMW Group | Munich, DE

View on ai-jobs.net

Analytics Engineering Manager

@ PlayStation Global | United Kingdom, London

View on ai-jobs.net

Junior Insight Analyst (PR&Comms)

@ Signal AI | Lisbon, Lisbon, Portugal

View on ai-jobs.net