s
April 9, 2024, 11:03 p.m. |

Simon Willison's Weblog simonwillison.net

Extracting data from unstructured text and images with Datasette and GPT-4 Turbo


Datasette Extract is a new Datasette plugin that uses GPT-4 Turbo (released to general availability today) and GPT-4 Vision to extract structured data from unstructured text and images.


I put together a video demo of the plugin in action today, and posted it to the Datasette Cloud blog along with screenshots and a tutorial describing how to use it.

ai availability data datasette datasettecloud demo extract general general availability generativeai gpt gpt-4 gpt4 gpt-4 vision images llms openai plugin projects structured data text together turbo unstructured video vision

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

Lead Developer (AI)

@ Cere Network | San Francisco, US

Research Engineer

@ Allora Labs | Remote

Ecosystem Manager

@ Allora Labs | Remote

Founding AI Engineer, Agents

@ Occam AI | New York