all AI news
Topic: evals
Mixtral 8x22B MoE - The New Best Open LLM? Fully-Tested
1 week, 6 days ago |
www.youtube.com
A Builder's Guide to Evals for LLM-based Applications
3 weeks, 3 days ago |
eugeneyan.com
Why You Should Not Use Numeric Evals For LLM As a Judge
1 month, 2 weeks ago |
towardsdatascience.com
[P]Retri-evals: Retrieval Evaluation Pipelines
3 months, 2 weeks ago |
www.reddit.com
Big Tech's LLM evals are just marketing
4 months, 1 week ago |
www.interconnects.ai
Openlayer: LLM Evals and Monitoring
4 months, 2 weeks ago |
www.producthunt.com
LLM Evals: Setup and the Metrics That Matter
6 months, 1 week ago |
towardsdatascience.com
Building the Foundation Model Ops Platform — with Raza Habib of Humanloop
6 months, 3 weeks ago |
www.latent.space
Day 14: Open NLLB - exploring BLEU, chrF++, logging (Pt 3. cont. 2)
7 months, 2 weeks ago |
www.youtube.com
Day 14: Open NLLB - exploring BLEU, chrF++, logging (Pt 3. cont.)
7 months, 2 weeks ago |
www.youtube.com
Day 14: Open NLLB - exploring BLEU, chrF++, logging (Pt 3.)
7 months, 2 weeks ago |
www.youtube.com
Day 14: Open NLLB - Eval of our first run (English, Turkish, Hindi) (Pt 2.)
7 months, 2 weeks ago |
www.youtube.com
Design Patterns for LLM Systems & Products
8 months, 3 weeks ago |
eugeneyan.com
7 Open Source Models From OpenAI
11 months, 3 weeks ago |
analyticsindiamag.com
Items published with this topic over the last 90 days.
Latest
Mixtral 8x22B MoE - The New Best Open LLM? Fully-Tested
1 week, 6 days ago |
www.youtube.com
A Builder's Guide to Evals for LLM-based Applications
3 weeks, 3 days ago |
eugeneyan.com
Why You Should Not Use Numeric Evals For LLM As a Judge
1 month, 2 weeks ago |
towardsdatascience.com
[P]Retri-evals: Retrieval Evaluation Pipelines
3 months, 2 weeks ago |
www.reddit.com
Big Tech's LLM evals are just marketing
4 months, 1 week ago |
www.interconnects.ai
Openlayer: LLM Evals and Monitoring
4 months, 2 weeks ago |
www.producthunt.com
LLM Evals: Setup and the Metrics That Matter
6 months, 1 week ago |
towardsdatascience.com
Building the Foundation Model Ops Platform — with Raza Habib of Humanloop
6 months, 3 weeks ago |
www.latent.space
Day 14: Open NLLB - exploring BLEU, chrF++, logging (Pt 3. cont. 2)
7 months, 2 weeks ago |
www.youtube.com
Day 14: Open NLLB - exploring BLEU, chrF++, logging (Pt 3. cont.)
7 months, 2 weeks ago |
www.youtube.com
Day 14: Open NLLB - exploring BLEU, chrF++, logging (Pt 3.)
7 months, 2 weeks ago |
www.youtube.com
Day 14: Open NLLB - Eval of our first run (English, Turkish, Hindi) (Pt 2.)
7 months, 2 weeks ago |
www.youtube.com
Design Patterns for LLM Systems & Products
8 months, 3 weeks ago |
eugeneyan.com
7 Open Source Models From OpenAI
11 months, 3 weeks ago |
analyticsindiamag.com
Topic trend (last 90 days)
Top (last 7 days)
Jobs in AI, ML, Big Data
Data Architect
@ University of Texas at Austin | Austin, TX
Data ETL Engineer
@ University of Texas at Austin | Austin, TX
Lead GNSS Data Scientist
@ Lurra Systems | Melbourne
Senior Machine Learning Engineer (MLOps)
@ Promaton | Remote, Europe
Data Science Specialist
@ Telstra | Telstra ICC Bengaluru
Senior Staff Engineer, Machine Learning
@ Nagarro | Remote, India