An Audio-Visual Speech Separation Model Inspired by Cortico-Thalamo-Cortical Circuits | allainews.com

March 26, 2024, 4:49 a.m. | Kai Li, Fenghua Xie, Hang Chen, Kexin Yuan, Xiaolin Hu

cs.CV updates on arXiv.org arxiv.org

arXiv:2212.10744v2 Announce Type: replace-cross
Abstract: Audio-visual approaches involving visual inputs have laid the foundation for recent progress in speech separation. However, the optimization of the concurrent usage of auditory and visual inputs is still an active research area. Inspired by the cortico-thalamo-cortical circuit, in which the sensory processing mechanisms of different modalities modulate one another via the non-lemniscal sensory thalamus, we propose a novel cortico-thalamo-cortical neural network (CTCNet) for audio-visual speech separation (AVSS). First, the CTCNet learns hierarchical auditory and …

arxiv audio circuits cs.cv cs.sd speech type visual

More from arxiv.org / cs.CV updates on arXiv.org

Validating polyp and instrument segmentation methods in colonoscopy through Medico 2020 and MedAI 2021 Challenges 22 hours ago | arxiv.org

abstract analysis arxiv challenges +11

ReFACT: Updating Text-to-Image Models by Editing the Text Encoder 22 hours ago | arxiv.org

abstract arxiv become challenge +17

Yuille-Poggio's Flow and Global Minimizer of Polynomials through Convexification by Heat Evolution 22 hours ago | arxiv.org

abstract algorithm arxiv cs.cv +9

Motion State: A New Benchmark Multiple Object Tracking 22 hours ago | arxiv.org

abstract analysis arxiv benchmark +18

Paint-it: Text-to-Texture Synthesis via Deep Convolutional Texture Map Optimization and Physically-Based Rendering 22 hours ago | arxiv.org

arxiv convolutional cs.ai cs.cv +10

A Unified Approach for Text- and Image-guided 4D Scene Generation 22 hours ago | arxiv.org

3d scene generation abstract arxiv cs.cv +17

From Pixels to Titles: Video Game Identification by Screenshots using Convolutional Neural Networks 22 hours ago | arxiv.org

abstract architectures arxiv cnn +24

Amodal Optical Flow 22 hours ago | arxiv.org

arxiv cs.ai cs.cv cs.ro +4

Interpretable Geoscience Artificial Intelligence (XGeoS-AI): Application to Demystify Image Recognition 22 hours ago | arxiv.org

abstract ai models application artificial +21

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

View on ai-jobs.net

Lead Developer (AI)

@ Cere Network | San Francisco, US

View on ai-jobs.net

Research Engineer

@ Allora Labs | Remote

View on ai-jobs.net

Ecosystem Manager

@ Allora Labs | Remote

View on ai-jobs.net

Founding AI Engineer, Agents

@ Occam AI | New York

View on ai-jobs.net

AI Engineer Intern, Agents

@ Occam AI | US

View on ai-jobs.net