all AI news
DeepFilterNet2: Towards Real-Time Speech Enhancement on Embedded Devices for Full-Band Audio. (arXiv:2205.05474v1 [eess.AS])
cs.LG updates on arXiv.org arxiv.org
Deep learning-based speech enhancement has seen huge improvements and
recently also expanded to full band audio (48 kHz). However, many approaches
have a rather high computational complexity and require big temporal buffers
for real time usage e.g. due to temporal convolutions or attention. Both make
those approaches not feasible on embedded devices. This work further extends
DeepFilterNet, which exploits harmonic structure of speech allowing for
efficient speech enhancement (SE). Several optimizations in the training
procedure, data augmentation, and network structure …
arxiv audio devices embedded embedded devices real-time speech time