all AI news
Benchmarking Agent Tool Use
Dec. 20, 2023, 6:28 a.m. | LangChain
LangChain blog.langchain.dev
Agents may be the “killer” LLM app, but building and evaluating agents is hard. Function calling is a key skill for effective tool use, but there aren’t many good benchmarks for measuring function calling performance. Today, we are excited to release four new test environments for
agent agents app benchmarking benchmarks building environments function good llm measuring performance release test tool
More from blog.langchain.dev / LangChain
[Week of 4/29] LangChain Release Notes
2 days, 11 hours ago |
blog.langchain.dev
Regression Testing with LangSmith
4 days, 10 hours ago |
blog.langchain.dev
[Week of 4/15] LangChain Release Notes
2 weeks, 2 days ago |
blog.langchain.dev
Jobs in AI, ML, Big Data
Founding AI Engineer, Agents
@ Occam AI | New York
AI Engineer Intern, Agents
@ Occam AI | US
AI Research Scientist
@ Vara | Berlin, Germany and Remote
Data Architect
@ University of Texas at Austin | Austin, TX
Data ETL Engineer
@ University of Texas at Austin | Austin, TX
Machine Learning Engineer
@ Apple | Sunnyvale, California, United States