all AI news
LlamaIndex Workshop: Evaluation-Driven Development (EDD)
Oct. 23, 2023, 12:10 a.m. | LlamaIndex
LlamaIndex www.youtube.com
1. Defining evaluation metrics (performance metrics like faithfulness/relevancy or system metrics like latency/cost)
2. Creating an evaluation dataset
3. Defining a baseline
4. Trying out different approaches
We're excited to feature Wenqi Glantz, an open-source evangelist who has a series of wonderful blogs on this topic:
https://levelup.gitconnected.com/evaluation-driven-development-the-swiss-army-knife-for-rag-pipelines-dba24218d47e
https://levelup.gitconnected.com/exploring-zephyr-7b-alpha-through-the-lens-of-evaluation-driven-development-faf69e9d9ec7
apps build cost dataset development evaluation evaluation metrics feature latency llamaindex llm llm apps metrics performance production workshop
More from www.youtube.com / LlamaIndex
Introspective Agents: Performing Tasks With Reflection with LlamaIndex
1 week, 3 days ago |
www.youtube.com
Retrieval-Augmented Agents (Part 3, Introduction to Agents)
3 weeks, 6 days ago |
www.youtube.com
Function Calling Agent (Part 2, Introduction to Agents)
3 weeks, 6 days ago |
www.youtube.com
ReAct Agent (Part 1, Introduction to Agents)
3 weeks, 6 days ago |
www.youtube.com
An Introduction to Agents Tutorial Series
3 weeks, 6 days ago |
www.youtube.com
LlamaIndex Webinar: Retrieval-Augmented Fine-Tuning (RAFT)
4 weeks, 1 day ago |
www.youtube.com
Jobs in AI, ML, Big Data
Data Engineer
@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania
Artificial Intelligence – Bioinformatic Expert
@ University of Texas Medical Branch | Galveston, TX
Lead Developer (AI)
@ Cere Network | San Francisco, US
Research Engineer
@ Allora Labs | Remote
Ecosystem Manager
@ Allora Labs | Remote
Founding AI Engineer, Agents
@ Occam AI | New York