all AI news
Your Co-Workers Matter: Evaluating Collaborative Capabilities of Language Models in Blocks World
April 2, 2024, 7:51 p.m. | Guande Wu, Chen Zhao, Claudio Silva, He He
cs.CL updates on arXiv.org arxiv.org
Abstract: Language agents that interact with the world on their own have great potential for automating digital tasks. While large language model (LLM) agents have made progress in understanding and executing tasks such as textual games and webpage control, many real-world tasks also require collaboration with humans or other LLMs in equal roles, which involves intent understanding, task coordination, and communication. To test LLM's ability to collaborate, we design a blocks-world environment, where two agents, each …
abstract agents arxiv capabilities collaborative control cs.ai cs.cl cs.hc digital games language language model language models large language large language model llm matter progress tasks textual type understanding workers world
More from arxiv.org / cs.CL updates on arXiv.org
Jobs in AI, ML, Big Data
Data Engineer
@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania
Artificial Intelligence – Bioinformatic Expert
@ University of Texas Medical Branch | Galveston, TX
Lead Developer (AI)
@ Cere Network | San Francisco, US
Research Engineer
@ Allora Labs | Remote
Ecosystem Manager
@ Allora Labs | Remote
Founding AI Engineer, Agents
@ Occam AI | New York