May 1, 2024, 4:41 a.m. | Paul Pu Liang

cs.LG updates on arXiv.org arxiv.org

arXiv:2404.18976v1 Announce Type: new
Abstract: Building multisensory AI systems that learn from multiple sensory inputs such as text, speech, video, real-world sensors, wearable devices, and medical data holds great promise for impact in many scientific areas with practical benefits, such as in supporting human health and well-being, enabling multimedia content processing, and enhancing real-world autonomous agents. By synthesizing a range of theoretical frameworks and application domains, this thesis aims to advance the machine learning foundations of multisensory AI. In the …

abstract ai systems artificial artificial intelligence arxiv autonomous benefits building cs.ai cs.cl cs.cv cs.lg cs.mm data devices enabling health human impact inputs intelligence learn medical medical data multimedia multiple practical processing scientific sensors sensory speech systems text type video wearable wearable devices world

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

Lead Developer (AI)

@ Cere Network | San Francisco, US

Research Engineer

@ Allora Labs | Remote

Ecosystem Manager

@ Allora Labs | Remote

Founding AI Engineer, Agents

@ Occam AI | New York