all AI news
[R] Towards General Computer Control: A Multimodal Agent for Red Dead Redemption II as a Case Study - Beijing Academy of Artificial Intelligence (BAAI) 2024 - First Agent able to follow and finish real missions in a AAA game!
March 8, 2024, 9:02 a.m. | /u/Singularian2501
Machine Learning www.reddit.com
Projekt Website with code and videos: [https://baai-agents.github.io/Cradle/](https://baai-agents.github.io/Cradle/)
Abstract:
>Despite the success in specific tasks and scenarios, existing foundation agents, empowered by large models (LMs) and advanced tools, still cannot generalize to different scenarios, mainly due to dramatic differences in the observations and actions across scenarios. In this work, we propose the General Computer Control (GCC) setting: building foundation agents that can master any computer task by taking only screen images (and possibly audio) of the computer as input, …
abstract advanced agents building computer control differences foundation gcc general large models lms machinelearning master specific tasks success tasks tools work
More from www.reddit.com / Machine Learning
Jobs in AI, ML, Big Data
Senior Machine Learning Engineer
@ GPTZero | Toronto, Canada
ML/AI Engineer / NLP Expert - Custom LLM Development (x/f/m)
@ HelloBetter | Remote
Werkstudent Data Architecture & Governance (w/m/d)
@ E.ON | Essen, DE
Data Architect, Data Lake, Professional Services
@ Amazon.com | Bogota, DC, COL
Data Architect, Data Lake, Professional Services
@ Amazon.com | Buenos Aires City, Buenos Aires Autonomous City, ARG
Data Architect
@ Bitful | United States - Remote