Oct. 22, 2023, 1:49 p.m. | Andrej Baranovskij

Andrej Baranovskij www.youtube.com

I explained how to set up local LLM RAG to process invoice data with Llama2 13B. Based on my experiments, Llama2 13B works better with tabular data compared to Mistral 7B model. This example presents a production LLM RAG setup with Weaviate database for vector embeddings, Haystack for LLM API, and Llama.cpp to run Llama2 13b on a local CPU.

GitHub repo:
https://github.com/katanaml/llm-rag-invoice-cpu/

0:00 Intro
1:10 Examples
6:35 Setup
8:20 Config
9:45 Weaviate Docker
10:05 Data Ingest Code
10:30 Inference …

13b cpp cpu data database data processing embeddings example explained haystack invoice llama llama2 llm llm rag mistral mistral 7b process processing production rag set setup tabular tabular data vector vector embeddings weaviate

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Business Data Scientist, gTech Ads

@ Google | Mexico City, CDMX, Mexico

Lead, Data Analytics Operations

@ Zocdoc | Pune, Maharashtra, India