all AI news
Blockwise Streaming Transformer for Spoken Language Understanding and Simultaneous Speech Translation. (arXiv:2204.08920v1 [cs.CL])
cs.CL updates on arXiv.org arxiv.org
Although Transformers have gained success in several speech processing tasks
like spoken language understanding (SLU) and speech translation (ST), achieving
online processing while keeping competitive performance is still essential for
real-world interaction. In this paper, we take the first step on streaming SLU
and simultaneous ST using a blockwise streaming Transformer, which is based on
contextual block processing and blockwise synchronous beam search. Furthermore,
we design an automatic speech recognition (ASR)-based intermediate loss
regularization for the streaming SLU task to …
arxiv language speech spoken language understanding streaming transformer translation understanding