July 5, 2022, 1:12 a.m. | Guangzhi Sun, Chao Zhang, Philip C. Woodland

cs.CL updates on arXiv.org arxiv.org

Incorporating biasing words obtained as contextual knowledge is critical for
many automatic speech recognition (ASR) applications. This paper proposes the
use of graph neural network (GNN) encodings in a tree-constrained pointer
generator (TCPGen) component for end-to-end contextual ASR. By encoding the
biasing words in the prefix-tree with a tree-based GNN, lookahead for future
wordpieces in end-to-end ASR decoding is achieved at each tree node by
incorporating information about all wordpieces on the tree branches rooted from
it, which allows a …

arxiv generator graph graph neural network network neural network speech speech recognition tree

Data Scientist (m/f/x/d)

@ Symanto Research GmbH & Co. KG | Spain, Germany

Enterprise Data Quality, Senior Analyst

@ Toyota North America | Plano

Data Analyst & Audit Management Software (AMS) Coordinator

@ World Vision | Philippines - Home Working

Product Manager Power BI Platform Tech I&E Operational Insights

@ ING | HBP (Amsterdam - Haarlerbergpark)

Sr. Director, Software Engineering, Clinical Data Strategy

@ Moderna | USA-Washington-Seattle-1099 Stewart Street

Data Engineer (Data as a Service)

@ Xplor | Atlanta, GA, United States