Aug. 27, 2023, 5:55 p.m. | /u/CShorten

Deep Learning www.reddit.com

Hey everyone, we have a new tutorial on Weaviate YouTube going over 3 commonly used text chunking strategies, 1. Hard Splits, 2. Rolling Windows, and 3. Parsing by Headers!

I think Parsing by Headers is the most exciting strategy by far, especially because of it's ability to preserve semantic regions of a document. For example, if you look at this Reddit page it would make sense to separate the text in this post with the "Posting to Reddit" rules in …

code deeplearning hey overview parsing retrieval semantic strategies strategy text think tutorial weaviate windows youtube

Software Engineer for AI Training Data (School Specific)

@ G2i Inc | Remote

Software Engineer for AI Training Data (Python)

@ G2i Inc | Remote

Software Engineer for AI Training Data (Tier 2)

@ G2i Inc | Remote

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

Lead Developer (AI)

@ Cere Network | San Francisco, US