all AI news
Efficient Document Ranking with Learnable Late Interactions
June 27, 2024, 4:45 a.m. | Ziwei Ji, Himanshu Jain, Andreas Veit, Sashank J. Reddi, Sadeep Jayasumana, Ankit Singh Rawat, Aditya Krishna Menon, Felix Yu, Sanjiv Kumar
cs.LG updates on arXiv.org arxiv.org
Abstract: Cross-Encoder (CE) and Dual-Encoder (DE) models are two fundamental approaches for query-document relevance in information retrieval. To predict relevance, CE models use joint query-document embeddings, while DE models maintain factorized query and document embeddings; usually, the former has higher quality while the latter benefits from lower latency. Recently, late-interaction models have been proposed to realize more favorable latency-quality tradeoffs, by using a DE structure followed by a lightweight scorer based on query and document token …
abstract arxiv benefits cross-encoder cs.ai cs.ir cs.lg document document embeddings embeddings encoder fundamental information interactions latency quality query ranking retrieval stat.ml type while
More from arxiv.org / cs.LG updates on arXiv.org
MixerFlow: MLP-Mixer meets Normalising Flows
2 days, 3 hours ago |
arxiv.org
Kernelised Normalising Flows
2 days, 3 hours ago |
arxiv.org
Jobs in AI, ML, Big Data
Software Engineer II –Decision Intelligence Delivery and Support
@ Bristol Myers Squibb | Hyderabad
Senior Data Governance Consultant (Remote in US)
@ Resultant | Indianapolis, IN, United States
Power BI Developer
@ Brompton Bicycle | Greenford, England, United Kingdom
VP, Enterprise Applications
@ Blue Yonder | Scottsdale
Data Scientist - Moloco Commerce Media
@ Moloco | Redwood City, California, United States
Senior Backend Engineer (New York)
@ Kalepa | New York City. Hybrid