LangChain Data Loaders, Tokenizers, Chunking, and Datasets - Data Prep 101 | allainews.com

March 23, 2023, 2:53 p.m. | James Briggs

James Briggs www.youtube.com

In this video, we're going to focus on preparing our text using LangChain data loaders, tokenization using the tiktoken tokenizers, chunking with LangChain text splitters, and storing data with Hugging Face datasets. Naturally, the focus here is on OpenAI embedding and completion models, but we can apply the same logic to other LLMs like those available via Hugging Face, Cohere, and so on.

🔗 Notebook link:
https://github.com/pinecone-io/examples/blob/master/generation/langchain/handbook/xx-langchain-chunking.ipynb

🎙️ Support me on Patreon:
https://patreon.com/JamesBriggs

🎨 AI Art:
https://www.etsy.com/uk/shop/IntelligentArtEU

🤖 70% Discount …

ai art apply art article cohere course data data prep datasets embedding face focus hugging face langchain llms logic nlp notebook openai patreon python support text tokenization transformers video

More from www.youtube.com / James Briggs

LangGraph 101: it's better than LangChain 1 week, 3 days ago | www.youtube.com

agents ai agents build code +10

AI Agent Evaluation with RAGAS 4 weeks ago | www.youtube.com

agent anthropic article articles +19

AI Agent Evaluation with RAGAS 4 weeks, 1 day ago | www.youtube.com

agent anthropic article articles +19

Claude 3 Opus RAG Chatbot (Full Walkthrough) 1 month, 2 weeks ago | www.youtube.com

agent anthropic art chatbot +22

NSFW Image Detection with AI 1 month, 3 weeks ago | www.youtube.com

ai image classification clip code +16

AI Decision Making — Optimizing Routes 2 months ago | www.youtube.com

ai decision making decision decision making identify +4

Steerable AI with Pinecone + Semantic Router 2 months, 1 week ago | www.youtube.com

control database fine-grained pinecone +11

OpenAI's Sora 2 months, 2 weeks ago | www.youtube.com

blog consulting diffusion diffusion model +8

LangChain XML Agents with Anthropic, Cohere, and Pinecone 2 months, 3 weeks ago | www.youtube.com

agent agents anthropic building +16

AI Research Scientist

@ Vara | Berlin, Germany and Remote

View on ai-jobs.net

Data Architect

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

View on ai-jobs.net

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

View on ai-jobs.net

Senior Machine Learning Engineer

@ Samsara | Canada - Remote

View on ai-jobs.net