all AI news
[R] LLMs in the Imaginarium: Tool Learning through Simulated Trial and Error - Microsoft Semantic Machines 2024 - Giving Mistral-Instruct-7B a boost of 46,7% points and enabling it to outperform GPT-4 in the ToolBench benchmark!
March 8, 2024, 2:39 p.m. | /u/Singularian2501
Machine Learning www.reddit.com
Github: [https://github.com/microsoft/simulated-trial-and-error](https://github.com/microsoft/simulated-trial-and-error)
Abstract:
>Tools are essential for large language models (LLMs) to acquire up-to-date information and take consequential actions in external environments. Existing work on tool-augmented LLMs primarily focuses on the broad coverage of tools and the flexibility of adding new tools. However, a critical aspect that has surprisingly been understudied is simply how accurately an LLM uses tools for which it has been trained. We find that existing LLMs, including GPT-4 and open-source LLMs specifically fine-tuned for …
abstract augmented llms coverage environments flexibility however information language language models large language large language models llm llms machinelearning tool tools work
More from www.reddit.com / Machine Learning
Jobs in AI, ML, Big Data
Software Engineer for AI Training Data (School Specific)
@ G2i Inc | Remote
Software Engineer for AI Training Data (Python)
@ G2i Inc | Remote
Software Engineer for AI Training Data (Tier 2)
@ G2i Inc | Remote
Data Engineer
@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania
Artificial Intelligence – Bioinformatic Expert
@ University of Texas Medical Branch | Galveston, TX
Lead Developer (AI)
@ Cere Network | San Francisco, US