Web: https://www.reddit.com/r/MachineLearning/comments/razoa5/d_why_is_audio_so_far_behind_other_ml_application/

Dec. 7, 2021, 1:53 p.m. | /u/Crookedpenguin

Machine Learning reddit.com

I 'd like to gain some intuition and actual insight if someone has worked on audio, on why we haven't seen major breakthroughs in audio ML (tracking and localization) as we saw with Image processing and NLP. Ofcourse there is the issue of data and annotating which except for being a laborious task, it's also hard to define e.g how do you label a conversation of multiple individuals on a dinner table, but I would assume that if needed large corporations would attempt such a project.

It seems to me …

audio image processing machinelearning ml nlp

Statistics and Computer Science Specialist

@ Hawk-Research | Remote

Data Scientist, Credit/Fraud Strategy

@ Fora Financial | New York City

Postdoctoral Research Associate - Biomedical Natural Language Processing and Deep Learning

@ Oak Ridge National Laboratory - Oak Ridge, TN | Oak Ridge, TN, United States

Senior Machine Learning / Computer Vision Engineer

@ Glass Imaging | Los Altos, CA

Research Scientist in Biomedical Natural Language Processing and Deep Learning

@ Oak Ridge National Laboratory | Oak Ridge, TN

W3-Professorship for Intelligent Energy Management

@ Universität Bayreuth | Bayreuth, Germany