Benchmarking LLM performance with LangChain Auto-Evaluator // Lance Martin //LLMs in Prod Con Part 2 | allainews.com

July 31, 2023, 5:54 p.m. | MLOps.community

MLOps.community www.youtube.com

// Abstract
Document Question-Answering is a popular LLM use case. LangChain makes it easy to assemble LLM components (e.g., models and retrievers) into chains that support question-answering. But, it is not always obvious to (1) evaluate the answer quality and (2) use this evaluation to guide improved QA chain settings (e.g., chunk size, retrieved docs count) or components (e.g., model or retriever choice). We recently released an open-source, hosted app to address these limitations (see blog post here). We have …

abstract auto benchmarking case components easy evaluation langchain llm llm performance llms part performance popular prod quality support

More from www.youtube.com / MLOps.community

AI Quality in Mo's Eyes // Mohamed Elgendy // MLOps Podcast #229 clip 21 hours ago | www.youtube.com

abstract ceo co-founder concept +11

AI Innovations: The Power of Feature Platforms // MLOps Mini Summit #6 1 day, 20 hours ago | www.youtube.com

abstract ai innovations build building +19

FEDML Nexus AI: Your Generative AI Platform at Scale // Salman Avestimehr // MLOps podcast … 2 days, 20 hours ago | www.youtube.com

abstract ai applications ai platform applications +15

What is AI Quality? // Mohamed Elgendy // MLOps Podcast #229 6 days, 19 hours ago | www.youtube.com

abstract ceo co-founder concept +11

AI's Struggle with Abstraction in Analogies // Shane Morris // MLOps podcast #223 clip 1 week ago | www.youtube.com

abstract automation autonomous autonomous systems +19

The Mind Behind the AI Coding Assistant // Peter Guagenti // MLOps podcast #222 clip 1 week, 1 day ago | www.youtube.com

ai coding ai coding assistant assistant business +20

Streamlining Model Deployment // Daniel Lenton // AI in Production Talk 1 week, 2 days ago | www.youtube.com

abstract aiaas ai companies ai infrastructure +21

LLMOps and GenAI at Enterprise Scale - Challenges and Opportunities // Andy McMahon // AI … 1 week, 2 days ago | www.youtube.com

abstract andy challenges development +17

Data Labeling Best Practices // Charles Brecque // AI in Production Conference Lightning Talk 1 week, 2 days ago | www.youtube.com

abstract best practices bio conference +17

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

View on ai-jobs.net

Lead Developer (AI)

@ Cere Network | San Francisco, US

View on ai-jobs.net

Research Engineer

@ Allora Labs | Remote

View on ai-jobs.net

Ecosystem Manager

@ Allora Labs | Remote

View on ai-jobs.net

Founding AI Engineer, Agents

@ Occam AI | New York

View on ai-jobs.net

AI Engineer Intern, Agents

@ Occam AI | US

View on ai-jobs.net