Jan. 29, 2024, 12:10 p.m. | /u/Lathanderrr

Machine Learning www.reddit.com

Hello guys,

I need some advice, assume that you are building a RAG. You want your context chunks to be 512 token long. How to divide a solid 1000+ paragraph without loosing semantic connection.

For more information,
Its an question answering bot, that huge paragraph is answer to one of a frequently asked question.

advice bot building context hello information machinelearning question question answering rag semantic solid token

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

Lead Developer (AI)

@ Cere Network | San Francisco, US

Research Engineer

@ Allora Labs | Remote

Ecosystem Manager

@ Allora Labs | Remote

Founding AI Engineer, Agents

@ Occam AI | New York