Feb. 25, 2022, 2:10 a.m. | Herman Kamper

cs.CL updates on arXiv.org arxiv.org

Recent work on unsupervised speech segmentation has used self-supervised
models with a phone segmentation module and a word segmentation module that are
trained jointly. This paper compares this joint methodology with an older idea:
bottom-up phone-like unit discovery is performed first, and symbolic word
segmentation is then performed on top of the discovered units (without
influencing the lower level). I specifically describe a duration-penalized
dynamic programming (DPDP) procedure that can be used for either phone or word
segmentation by changing …

arxiv phone programming scoring segmentation

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Senior AI & Data Engineer

@ Bertelsmann | Kuala Lumpur, 14, MY, 50400

Analytics Engineer

@ Reverse Tech | Philippines - Remote