April 28, 2024, 3:34 p.m. | /u/Aggravating-Floor-38

Natural Language Processing www.reddit.com

Hey guys, need advice on techniques that really elevate rag from naive to an advanced system. I've built a rag system that scrapes data from the internet and uses that as context. I've worked a bit on chunking strategy and worked extensively on cleaning strategy for the scraped data, query expansion and rewriting, but haven't done much else. I don't think I can work on the metadata extraction aspect because I'm using local llms and using them for summaries and …

advanced advice cleaning context data expansion hey internet languagetechnology query rag strategy

Software Engineer for AI Training Data (School Specific)

@ G2i Inc | Remote

Software Engineer for AI Training Data (Python)

@ G2i Inc | Remote

Software Engineer for AI Training Data (Tier 2)

@ G2i Inc | Remote

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

Lead Developer (AI)

@ Cere Network | San Francisco, US