all AI news
Your Co-Workers Matter: Evaluating Collaborative Capabilities of Language Models in Blocks World
April 2, 2024, 7:51 p.m. | Guande Wu, Chen Zhao, Claudio Silva, He He
cs.CL updates on arXiv.org arxiv.org
Abstract: Language agents that interact with the world on their own have great potential for automating digital tasks. While large language model (LLM) agents have made progress in understanding and executing tasks such as textual games and webpage control, many real-world tasks also require collaboration with humans or other LLMs in equal roles, which involves intent understanding, task coordination, and communication. To test LLM's ability to collaborate, we design a blocks-world environment, where two agents, each …
abstract agents arxiv capabilities collaborative control cs.ai cs.cl cs.hc digital games language language model language models large language large language model llm matter progress tasks textual type understanding workers world
More from arxiv.org / cs.CL updates on arXiv.org
Jobs in AI, ML, Big Data
Founding AI Engineer, Agents
@ Occam AI | New York
AI Engineer Intern, Agents
@ Occam AI | US
AI Research Scientist
@ Vara | Berlin, Germany and Remote
Data Architect
@ University of Texas at Austin | Austin, TX
Data ETL Engineer
@ University of Texas at Austin | Austin, TX
Lead GNSS Data Scientist
@ Lurra Systems | Melbourne