April 28, 2024, noon | code_your_own_AI

code_your_own_AI www.youtube.com

Webarena provides a test ground to test AI Agents' performance for functional correctness of task completions. Ideal for development and performance testing of autonomous AI agents.
All rights w/ authors: https://arxiv.org/pdf/2307.13854

Plus: User's Intent research by Microsoft on optimized tool use by autonomous agents.

Plus the open source Toolkit from @CohereAI now available for your perfect RAG system with pre-build components and apps.
https://github.com/cohere-ai/cohere-toolkit

00:00 Autonomous AI Agents
00:45 Webarena for dev of agents
03:30 Microsoft's Geckopt multi-tool use
08:25 …

agents ai agents autonomous autonomous agents autonomous ai autonomous ai agents development functional max microsoft open source performance rag research test testing tool toolkit

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

Lead Developer (AI)

@ Cere Network | San Francisco, US

Research Engineer

@ Allora Labs | Remote

Ecosystem Manager

@ Allora Labs | Remote

Founding AI Engineer, Agents

@ Occam AI | New York