April 16, 2024, 1 p.m. | Anthony Alford

InfoQ - AI, ML & Data Engineering www.infoq.com

Google Research recently developed ScreenAI, a multimodal AI model for understanding infographics and user interfaces. ScreenAI is based on the PaLI architecture and achieves state-of-the-art performance on several tasks.

By Anthony Alford

ai ai model anthony architecture art computer vision deep learning google google research infographics interfaces ml & data engineering multimodal multimodal ai neural networks performance research state tasks trains understanding

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

Lead Developer (AI)

@ Cere Network | San Francisco, US

Research Engineer

@ Allora Labs | Remote

Ecosystem Manager

@ Allora Labs | Remote

Founding AI Engineer, Agents

@ Occam AI | New York