all AI news
Word Segmentation on Discovered Phone Units with Dynamic Programming and Self-Supervised Scoring. (arXiv:2202.11929v1 [cs.CL])
Feb. 25, 2022, 2:10 a.m. | Herman Kamper
cs.CL updates on arXiv.org arxiv.org
Recent work on unsupervised speech segmentation has used self-supervised
models with a phone segmentation module and a word segmentation module that are
trained jointly. This paper compares this joint methodology with an older idea:
bottom-up phone-like unit discovery is performed first, and symbolic word
segmentation is then performed on top of the discovered units (without
influencing the lower level). I specifically describe a duration-penalized
dynamic programming (DPDP) procedure that can be used for either phone or word
segmentation by changing …
More from arxiv.org / cs.CL updates on arXiv.org
Jobs in AI, ML, Big Data
Data Architect
@ University of Texas at Austin | Austin, TX
Data ETL Engineer
@ University of Texas at Austin | Austin, TX
Lead GNSS Data Scientist
@ Lurra Systems | Melbourne
Senior Machine Learning Engineer (MLOps)
@ Promaton | Remote, Europe
Senior AI & Data Engineer
@ Bertelsmann | Kuala Lumpur, 14, MY, 50400
Analytics Engineer
@ Reverse Tech | Philippines - Remote