Automatic Speech Recognition Datasets in Cantonese Language: A Survey and a New Dataset. (arXiv:2201.02419v1 [cs.CL]) | allainews.com

Jan. 10, 2022, 2:10 a.m. | Tiezheng Yu, Rita Frieske, Peng Xu, Samuel Cahyawijaya, Cheuk Tung Shadow Yiu, Holy Lovenia, Wenliang Dai, Elham J. Barezi, Qifeng Chen, Xiaojuan Ma,

cs.CL updates on arXiv.org arxiv.org

Automatic speech recognition (ASR) on low resource languages improves access
of linguistic minorities to technological advantages provided by Artificial
Intelligence (AI). In this paper, we address a problem of data scarcity of Hong
Kong Cantonese language by creating a new Cantonese dataset. Our dataset,
Multi-Domain Cantonese Corpus (MDCC), consists of 73.6 hours of clean read
speech paired with transcripts, collected from Cantonese audiobooks from Hong
Kong. It combines philosophy, politics, education, culture, lifestyle and
family domains, covering a wide range …

arxiv dataset datasets language speech speech recognition survey

More from arxiv.org / cs.CL updates on arXiv.org

LLMs for Science: Usage for Code Generation and Data Analysis 20 hours ago | arxiv.org

abstract analysis arxiv become +26

VAL: Interactive Task Learning with GPT Dialog Parsing 20 hours ago | arxiv.org

abstract acquisition arxiv box +22

Convergences and Divergences between Automatic Assessment and Human Evaluation: Insights from Comparing ChatGPT-Generated Translation and … 20 hours ago | arxiv.org

abstract arxiv assessment automated +23

Some things are more CRINGE than others: Iterative Preference Optimization with the Pairwise Cringe Loss 20 hours ago | arxiv.org

abstract arxiv binary cs.ai +13

DBCopilot: Scaling Natural Language Querying to Massive Databases 20 hours ago | arxiv.org

abstract advances arxiv challenges +31

ARN: Analogical Reasoning on Narratives 20 hours ago | arxiv.org

abstract analogy arxiv cognitive +17

Applying BioBERT to Extract Germline Gene-Disease Associations for Building a Knowledge Graph from the Biomedical … 20 hours ago | arxiv.org

abstract arxiv biomedical building +24

Learning the meanings of function words from grounded language using a visual question answering model 20 hours ago | arxiv.org

abstract acquisition arxiv children +17

RETVec: Resilient and Efficient Text Vectorizer 20 hours ago | arxiv.org

arxiv cs.ai cs.cl resilient +2

Data Architect

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

View on ai-jobs.net

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

View on ai-jobs.net

Social Insights & Data Analyst (Freelance)

@ Media.Monks | Jakarta

View on ai-jobs.net

Cloud Data Engineer

@ Arkatechture | Portland, ME, USA

View on ai-jobs.net