s
April 9, 2024, 11:03 p.m. |

Simon Willison's Weblog simonwillison.net

Extracting data from unstructured text and images with Datasette and GPT-4 Turbo


Datasette Extract is a new Datasette plugin that uses GPT-4 Turbo (released to general availability today) and GPT-4 Vision to extract structured data from unstructured text and images.


I put together a video demo of the plugin in action today, and posted it to the Datasette Cloud blog along with screenshots and a tutorial describing how to use it.

ai availability data datasette datasettecloud demo extract general general availability generativeai gpt gpt-4 gpt4 gpt-4 vision images llms openai plugin projects structured data text together turbo unstructured video vision

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

#13721 - Data Engineer - AI Model Testing

@ Qualitest | Miami, Florida, United States

Elasticsearch Administrator

@ ManTech | 201BF - Customer Site, Chantilly, VA