Easy-to-Follow RAG Pipeline Tutorial: Invoice Processing with ChromaDB & LangChain | allainews.com

Nov. 27, 2023, 12:42 p.m. | Andrej Baranovskij

Andrej Baranovskij www.youtube.com

I explain the implementation of the pipeline to process invoice data from PDF documents. The data is loaded into Chroma DB's vector store. Through LangChain API, the data from the vector store is ready to be consumed by LLM as part of the RAG infrastructure.

GitHub repo:
https://github.com/katanaml/llm-ollama-invoice-cpu

0:00 Intro
1:19 Libs
1:54 Ingest data with ChromaDB
6:17 Main script
6:59 Pipeline with LangChain
9:00 Testing and Summary

CONNECT:
- Subscribe to this YouTube channel
- Twitter: https://twitter.com/andrejusb
- LinkedIn: …

api chroma chromadb data documents easy github github repo implementation infrastructure intro invoice invoice processing langchain llm part pdf pipeline process processing rag store through tutorial vector vector store

More from www.youtube.com / Andrej Baranovskij

Sparrow Parse - Data Processing for LLM 11 hours ago | www.youtube.com

balance build code data +16

Invoice Data Preprocessing for LLM 1 week ago | www.youtube.com

data data preprocessing github github repo +13

You Don't Need RAG to Extract Invoice Data 2 weeks ago | www.youtube.com

class code data documents +11

LLM JSON Output with Instructor RAG and WizardLM-2 3 weeks ago | www.youtube.com

backend cleaning components data +21

Local RAG Explained with Unstructured and LangChain 4 weeks ago | www.youtube.com

agent code config data +17

Local LLM RAG with Unstructured and LangChain [Structured JSON] 1 month ago | www.youtube.com

class document dynamic format +16

LlamaIndex Upgrade to 0.10.x Experience 1 month, 2 weeks ago | www.youtube.com

code example experience github +11

LLM Structured Output for Function Calling with Ollama 1 month, 3 weeks ago | www.youtube.com

agent call concept environment +9

FastAPI File Upload and Temporary Directory for Stateless API 2 months ago | www.youtube.com

api directory fastapi file +8

Software Engineer for AI Training Data (School Specific)

@ G2i Inc | Remote

View on ai-jobs.net

Software Engineer for AI Training Data (Python)

@ G2i Inc | Remote

View on ai-jobs.net

Software Engineer for AI Training Data (Tier 2)

@ G2i Inc | Remote

View on ai-jobs.net

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

View on ai-jobs.net

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

View on ai-jobs.net

Lead Developer (AI)

@ Cere Network | San Francisco, US

View on ai-jobs.net