April 2, 2024, 7:42 p.m. | Sumit Soman, Sujoy Roychowdhury

cs.LG updates on arXiv.org arxiv.org

arXiv:2404.00657v1 Announce Type: new
Abstract: Retrieval augmented generation (RAG) for technical documents creates challenges as embeddings do not often capture domain information. We review prior art for important factors affecting RAG and perform experiments to highlight best practices and potential challenges to build RAG systems for technical documents.

abstract art arxiv best practices build building challenges cs.ai cs.cl cs.lg documents domain embeddings highlight information practices prior rag retrieval retrieval augmented generation review systems technical type

Founding AI Engineer, Agents

@ Occam AI | New York

AI Engineer Intern, Agents

@ Occam AI | US

AI Research Scientist

@ Vara | Berlin, Germany and Remote

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne