March 5, 2024, 2:45 p.m. | Ruizhe Cao, Sherif Abdulatif, Bin Yang

cs.LG updates on arXiv.org arxiv.org

arXiv:2203.15149v4 Announce Type: replace-cross
Abstract: Recently, convolution-augmented transformer (Conformer) has achieved promising performance in automatic speech recognition (ASR) and time-domain speech enhancement (SE), as it can capture both local and global dependencies in the speech signal. In this paper, we propose a conformer-based metric generative adversarial network (CMGAN) for SE in the time-frequency (TF) domain. In the generator, we utilize two-stage conformer blocks to aggregate all magnitude and complex spectrogram information by modeling both time and frequency dependencies. The estimation …

abstract adversarial arxiv asr automatic speech recognition convolution cs.ai cs.lg cs.sd dependencies domain eess.as gan generative generative adversarial network global network paper performance recognition signal speech speech recognition transformer type

Software Engineer for AI Training Data (School Specific)

@ G2i Inc | Remote

Software Engineer for AI Training Data (Python)

@ G2i Inc | Remote

Software Engineer for AI Training Data (Tier 2)

@ G2i Inc | Remote

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

Lead Developer (AI)

@ Cere Network | San Francisco, US