Oct. 26, 2022, 1:12 a.m. | Gasser Elbanna, Neil Scheidwasser-Clow, Mikolaj Kegler, Pierre Beckmann, Karl El Hajal, Milos Cernak

cs.LG updates on arXiv.org arxiv.org

Methods for extracting audio and speech features have been studied since
pioneering work on spectrum analysis decades ago. Recent efforts are guided by
the ambition to develop general-purpose audio representations. For example,
deep neural networks can extract optimal embeddings if they are trained on
large audio datasets. This work extends existing methods based on
self-supervised learning by bootstrapping, proposes various encoder
architectures, and explores the effects of using different pre-training
datasets. Lastly, we present a novel training framework to come …

arxiv bootstrapping speech

Software Engineer for AI Training Data (School Specific)

@ G2i Inc | Remote

Software Engineer for AI Training Data (Python)

@ G2i Inc | Remote

Software Engineer for AI Training Data (Tier 2)

@ G2i Inc | Remote

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

Lead Developer (AI)

@ Cere Network | San Francisco, US