CMGAN: Conformer-Based Metric-GAN for Monaural Speech Enhancement. (arXiv:2209.11112v2 [cs.SD] UPDATED) | allainews.com

Sept. 26, 2022, 1:12 a.m. | Sherif Abdulatif, Ruizhe Cao, Bin Yang

cs.LG updates on arXiv.org arxiv.org

Convolution-augmented transformers (Conformers) are recently proposed in
various speech-domain applications, such as automatic speech recognition (ASR)
and speech separation, as they can capture both local and global dependencies.
In this paper, we propose a conformer-based metric generative adversarial
network (CMGAN) for speech enhancement (SE) in the time-frequency (TF) domain.
The generator encodes the magnitude and complex spectrogram information using
two-stage conformer blocks to model both time and frequency dependencies. The
decoder then decouples the estimation into a magnitude mask decoder …

arxiv gan speech

More from arxiv.org / cs.LG updates on arXiv.org

Training towards significance with the decorrelated event classifier transformer neural network an hour ago | arxiv.org

abstract analysis application arxiv +28

An adaptive standardisation methodology for Day-Ahead electricity price forecasting an hour ago | arxiv.org

abstract algorithms arxiv complexity +18

SYNAuG: Exploiting Synthetic Data for Data Imbalance Problems an hour ago | arxiv.org

abstract arxiv cs.cv cs.lg +17

Semantic Positive Pairs for Enhancing Visual Representation Learning of Instance Discrimination methods an hour ago | arxiv.org

abstract algorithms arxiv augmentation +17

Description-Based Text Similarity an hour ago | arxiv.org

abstract arxiv cases cs.cl +14

Improving Gradient Methods via Coordinate Transformations: Applications to Quantum Machine Learning an hour ago | arxiv.org

abstract algorithms applications arxiv +13

A Generative Framework for Low-Cost Result Validation of Machine Learning-as-a-Service Inference an hour ago | arxiv.org

abstract applications arxiv as-a-service +26

Digital Over-the-Air Federated Learning in Multi-Antenna Systems an hour ago | arxiv.org

abstract arxiv communication computation +16

Bagging Provides Assumption-free Stability an hour ago | arxiv.org

abstract algorithm arxiv assumptions +15

Data Architect

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

View on ai-jobs.net

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

View on ai-jobs.net

Technology Consultant Master Data Management (w/m/d)

@ SAP | Walldorf, DE, 69190

View on ai-jobs.net

Research Engineer, Computer Vision, Google Research

@ Google | Nairobi, Kenya

View on ai-jobs.net