all AI news
Benchmarking Agent Tool Use
Dec. 20, 2023, 6:28 a.m. | LangChain
LangChain blog.langchain.dev
Agents may be the “killer” LLM app, but building and evaluating agents is hard. Function calling is a key skill for effective tool use, but there aren’t many good benchmarks for measuring function calling performance. Today, we are excited to release four new test environments for
agent agents app benchmarking benchmarks building environments function good llm measuring performance release test tool
More from blog.langchain.dev / LangChain
[Week of 5/13] LangChain Release Notes
2 days, 23 hours ago |
blog.langchain.dev
Integrating LangChain with Azure Container Apps dynamic sessions
3 days, 21 hours ago |
blog.langchain.dev
LangChain v0.2: A Leap Towards Stability
1 week, 2 days ago |
blog.langchain.dev
Access Control Updates for LangSmith
1 week, 5 days ago |
blog.langchain.dev
[Week of 4/29] LangChain Release Notes
2 weeks, 2 days ago |
blog.langchain.dev
Regression Testing with LangSmith
2 weeks, 4 days ago |
blog.langchain.dev
Jobs in AI, ML, Big Data
Software Engineer for AI Training Data (School Specific)
@ G2i Inc | Remote
Software Engineer for AI Training Data (Python)
@ G2i Inc | Remote
Software Engineer for AI Training Data (Tier 2)
@ G2i Inc | Remote
Data Engineer
@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania
Artificial Intelligence – Bioinformatic Expert
@ University of Texas Medical Branch | Galveston, TX
Lead Developer (AI)
@ Cere Network | San Francisco, US