DeepFilterNet2: Towards Real-Time Speech Enhancement on Embedded Devices for Full-Band Audio. (arXiv:2205.05474v1 [eess.AS]) | allainews.com

May 12, 2022, 1:11 a.m. | Hendrik Schröter, Alberto N. Escalante-B., Tobias Rosenkranz, Andreas Maier

cs.LG updates on arXiv.org arxiv.org

Deep learning-based speech enhancement has seen huge improvements and
recently also expanded to full band audio (48 kHz). However, many approaches
have a rather high computational complexity and require big temporal buffers
for real time usage e.g. due to temporal convolutions or attention. Both make
those approaches not feasible on embedded devices. This work further extends
DeepFilterNet, which exploits harmonic structure of speech allowing for
efficient speech enhancement (SE). Several optimizations in the training
procedure, data augmentation, and network structure …

arxiv audio devices embedded embedded devices real-time speech time

More from arxiv.org / cs.LG updates on arXiv.org

Deep learning enhanced mixed integer optimization: Learning to reduce model dimensionality 12 hours ago | arxiv.org

abstract arxiv complexity computational +20

Moderating New Waves of Online Hate with Chain-of-Thought Reasoning in Large Language Models 12 hours ago | arxiv.org

abstract arxiv cs.cl cs.cy +14

CaloQVAE : Simulating high-energy particle-calorimeter interactions using hybrid quantum-classical generative models 12 hours ago | arxiv.org

abstract analysis arxiv challenges +23

Swallowing the Bitter Pill: Simplified Scalable Conformer Generation 12 hours ago | arxiv.org

abstract advantages art arxiv +18

Intrinsic Bayesian Cram\'er-Rao Bound with an Application to Covariance Matrix Estimation 12 hours ago | arxiv.org

abstract application arxiv bayesian +18

Field-level simulation-based inference with galaxy catalogs: the impact of systematic effects 12 hours ago | arxiv.org

abstract arxiv astro-ph.co astro-ph.ga +19

Faithfulness Measurable Masked Language Models 12 hours ago | arxiv.org

abstract arxiv cs.cl cs.lg +12

Preserving Tumor Volumes for Unsupervised Medical Image Registration 12 hours ago | arxiv.org

arxiv cs.cv cs.lg eess.iv +6

Flexible and efficient spatial extremes emulation via variational autoencoders 12 hours ago | arxiv.org

abstract aim arxiv autoencoders +13

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

View on ai-jobs.net

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

View on ai-jobs.net

Lead Developer (AI)

@ Cere Network | San Francisco, US

View on ai-jobs.net

Research Engineer

@ Allora Labs | Remote

View on ai-jobs.net

Ecosystem Manager

@ Allora Labs | Remote

View on ai-jobs.net

Founding AI Engineer, Agents

@ Occam AI | New York

View on ai-jobs.net