Nov. 14, 2023, 11:56 p.m. | Mike Young

Replicate Codex notes.replicatecodex.com

It's still early, but a GPT-4V agent can navigate smartphone GUIs using a combination of image processing and text-based reasoning.

agent amazon app combination gpt gpt-4v image image processing iphone plain english papers processing reasoning researchers smartphone text

Founding AI Engineer, Agents

@ Occam AI | New York

AI Engineer Intern, Agents

@ Occam AI | US

AI Research Scientist

@ Vara | Berlin, Germany and Remote

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne