all AI news
A Unified Optimal Transport Framework for Cross-Modal Retrieval with Noisy Labels
March 21, 2024, 4:45 a.m. | Haochen Han, Minnan Luo, Huan Liu, Fang Nan
cs.CV updates on arXiv.org arxiv.org
Abstract: Cross-modal retrieval (CMR) aims to establish interaction between different modalities, among which supervised CMR is emerging due to its flexibility in learning semantic category discrimination. Despite the remarkable performance of previous supervised CMR methods, much of their success can be attributed to the well-annotated data. However, even for unimodal data, precise annotation is expensive and time-consuming, and it becomes more challenging with the multimodal scenario. In practice, massive multimodal data are collected from the Internet …
abstract arxiv cs.cv cs.ir cs.mm discrimination flexibility framework labels modal performance retrieval semantic success transport type
More from arxiv.org / cs.CV updates on arXiv.org
Compact 3D Scene Representation via Self-Organizing Gaussian Grids
1 day, 19 hours ago |
arxiv.org
Fingerprint Matching with Localized Deep Representation
1 day, 19 hours ago |
arxiv.org
Jobs in AI, ML, Big Data
Founding AI Engineer, Agents
@ Occam AI | New York
AI Engineer Intern, Agents
@ Occam AI | US
AI Research Scientist
@ Vara | Berlin, Germany and Remote
Data Architect
@ University of Texas at Austin | Austin, TX
Data ETL Engineer
@ University of Texas at Austin | Austin, TX
Lead GNSS Data Scientist
@ Lurra Systems | Melbourne