all AI news
End-to-End Neural Audio Coding for Real-Time Communications. (arXiv:2201.09429v2 [cs.SD] UPDATED)
Jan. 26, 2022, 2:11 a.m. | Xue Jiang, Xiulian Peng, Chengyu Zheng, Huaying Xue, Yuan Zhang, Yan Lu
cs.LG updates on arXiv.org arxiv.org
Deep-learning based methods have shown their advantages in audio coding over
traditional ones but limited attention has been paid on real-time
communications (RTC). This paper proposes the TFNet, an end-to-end neural audio
codec with low latency for RTC. It takes an encoder-temporal filtering-decoder
paradigm that seldom being investigated in audio coding. An interleaved
structure is proposed for temporal filtering to capture both short-term and
long-term temporal dependencies. Furthermore, with end-to-end optimization, the
TFNet is jointly optimized with speech enhancement and …
More from arxiv.org / cs.LG updates on arXiv.org
Jobs in AI, ML, Big Data
Data Architect
@ University of Texas at Austin | Austin, TX
Data ETL Engineer
@ University of Texas at Austin | Austin, TX
Lead GNSS Data Scientist
@ Lurra Systems | Melbourne
Senior Machine Learning Engineer (MLOps)
@ Promaton | Remote, Europe
Senior AI & Data Engineer
@ Bertelsmann | Kuala Lumpur, 14, MY, 50400
Analytics Engineer
@ Reverse Tech | Philippines - Remote