April 15, 2024, 11:52 a.m. | Andrej Baranovskij

Andrej Baranovskij www.youtube.com

Using unstructured library to pre-process PDF document content, to be in a cleaner format. This helps LLM to produce more accurate response. JSON response is generated thanks to Nous Hermes 2 PRO LLM. Without any additional post-processing. Using Pydantic dynamic class to validate response to make sure it matches request.

Sparrow GitHub repo:
https://github.com/katanaml/sparrow

0:00 Intro
0:44 Example
2:35 Code - Requirements
3:06 Code - Config
4:26 Code - HTML text
5:10 Code - Agent
8:03 Summary

CONNECT:
- Subscribe …

class document dynamic format generated json langchain library llm llm rag nous nous hermes pdf post-processing process processing pydantic rag request unstructured

Software Engineer for AI Training Data (School Specific)

@ G2i Inc | Remote

Software Engineer for AI Training Data (Python)

@ G2i Inc | Remote

Software Engineer for AI Training Data (Tier 2)

@ G2i Inc | Remote

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

Lead Developer (AI)

@ Cere Network | San Francisco, US