all AI news
Multimodal Learned Sparse Retrieval with Probabilistic Expansion Control
Feb. 28, 2024, 5:47 a.m. | Thong Nguyen, Mariya Hendriksen, Andrew Yates, Maarten de Rijke
cs.CV updates on arXiv.org arxiv.org
Abstract: Learned sparse retrieval (LSR) is a family of neural methods that encode queries and documents into sparse lexical vectors that can be indexed and retrieved efficiently with an inverted index. We explore the application of LSR to the multi-modal domain, with a focus on text-image retrieval. While LSR has seen success in text retrieval, its application in multimodal retrieval remains underexplored. Current approaches like LexLIP and STAIR require complex multi-step training on massive datasets. Our …
abstract application arxiv control cs.cv cs.ir documents domain encode expansion explore family focus image index modal multi-modal multimodal queries retrieval text text-image text-image retrieval type vectors
More from arxiv.org / cs.CV updates on arXiv.org
Jobs in AI, ML, Big Data
Artificial Intelligence – Bioinformatic Expert
@ University of Texas Medical Branch | Galveston, TX
Lead Developer (AI)
@ Cere Network | San Francisco, US
Research Engineer
@ Allora Labs | Remote
Ecosystem Manager
@ Allora Labs | Remote
Founding AI Engineer, Agents
@ Occam AI | New York
AI Engineer Intern, Agents
@ Occam AI | US