Jan. 29, 2024, 12:10 p.m. | /u/Lathanderrr

Machine Learning www.reddit.com

Hello guys,

I need some advice, assume that you are building a RAG. You want your context chunks to be 512 token long. How to divide a solid 1000+ paragraph without loosing semantic connection.

For more information,
Its an question answering bot, that huge paragraph is answer to one of a frequently asked question.

advice bot building context hello information machinelearning question question answering rag semantic solid token

AI Research Scientist

@ Vara | Berlin, Germany and Remote

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Senior Software Engineer, Generative AI (C++)

@ SoundHound Inc. | Toronto, Canada