Multimodal technique for analyzing audio and visual data improves performance of machine-learning models | allainews.com

June 6, 2023, 5:08 p.m. |

News on Artificial Intelligence and Machine Learning techxplore.com

Researchers from MIT, the MIT-IBM Watson AI Lab, IBM Research, and elsewhere have developed a new technique for analyzing unlabeled audio and visual data that could improve the performance of machine-learning models used in applications like speech recognition and object detection. The work, for the first time, combines two architectures of self-supervised learning, contrastive learning and masked data modeling, in an effort to scale machine-learning tasks like event classification in single- and multimodal data without the need for annotation, thereby …

applications audio computer sciences data detection ibm ibm research lab machine mit mit-ibm watson ai lab multimodal performance recognition research researchers speech speech recognition visual data watson work

More from techxplore.com / News on Artificial Intelligence and Machine Learning

How artificial intelligence can transform U.S. energy infrastructure 2 days, 4 hours ago | techxplore.com

artificial artificial intelligence carbon change +15

Deepfake of principal's voice is the latest case of AI being used for harm 2 days, 6 hours ago | techxplore.com

artificial artificial intelligence case deepfake +10

Financial Times enters ChatGPT content deal 2 days, 6 hours ago | techxplore.com

chatbot chatgpt deal financial +7

Researchers create verification techniques to increase security in AI and image processing 2 days, 6 hours ago | techxplore.com

computing create efficiency europe +14

Researchers use ChatGPT for choreographies with flying robots 2 days, 6 hours ago | techxplore.com

chatgpt drones filter flying +14

Microsoft expands its AI empire abroad 5 days, 18 hours ago | techxplore.com

artificial artificial intelligence billion business +8

Microsoft claims that small, localized language models can be powerful as well 1 week ago | techxplore.com

ai language models arxiv business cost +13

Research team develops novel metric for evaluation of risk-return tradeoff in off-policy evaluation 1 week ago | techxplore.com

decision error evaluation however +20

A new framework to generate human motions from language prompts 1 week, 1 day ago | techxplore.com

advanced algorithms become compiling +14

Data Architect

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

View on ai-jobs.net

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

View on ai-jobs.net

Data Scientist

@ Publicis Groupe | New York City, United States

View on ai-jobs.net

Bigdata Cloud Developer - Spark - Assistant Manager

@ State Street | Hyderabad, India

View on ai-jobs.net