Lessons after a half-billion GPT tokens

April 13, 2024, 8:54 p.m. |

Simon Willison's Weblog simonwillison.net

Ken Kantzer presents some hard-won experience from shipping real features on top of OpenAI's models.

They ended up settling on a very basic abstraction over the chat API - mainly to handle automatic retries on a 500 error. No complex wrappers, not even JSON mode or function calling or system prompts.

Rather than counting tokens they estimate tokens as 3 times the length in characters, which works well enough.

One challenge they highlight for …

abstraction ai api basic billion chat error experience features function generativeai gpt json llms openai promptengineering shipping tokens

Visit resource

More from simonwillison.net / Simon Willison's Weblog

GPUs Go Brrr an hour ago | simonwillison.net

ai figure flat gpus +12

Parsing PNG images in Mojo 9 hours ago | simonwillison.net

building chris chris lattner code +13

About ARDC (Amateur Radio Digital Communications) 12 hours ago | simonwillison.net

advance block communication communications +8

“Link In Bio” is a slow knife 15 hours ago | simonwillison.net

anildash bio dash instagram +7

Ham radio general exam question pool as JSON 1 day, 10 hours ago | simonwillison.net

data datasette exam general +12

Exploring Hacker News by mapping and analyzing 40 million posts and comments for fun 2 days, 13 hours ago | simonwillison.net

api data data engineering embeddings +11

uv pip install --exclude-newer example 2 days, 13 hours ago | simonwillison.net

command example feature git +6

Bullying in Open Source Software Is a Massive Security Vulnerability 3 days, 7 hours ago | simonwillison.net

backdoor contributor linux linux distributions +13

experimental-phi3-webgpu 3 days, 7 hours ago | simonwillison.net

ai browser browsers cache +20

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

View on ai-jobs.net

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

View on ai-jobs.net

Lead Developer (AI)

@ Cere Network | San Francisco, US

View on ai-jobs.net

Research Engineer

@ Allora Labs | Remote

View on ai-jobs.net

Ecosystem Manager

@ Allora Labs | Remote

View on ai-jobs.net

Founding AI Engineer, Agents

@ Occam AI | New York

View on ai-jobs.net

View more jobs

all AI news

Lessons after a half-billion GPT tokens

More from simonwillison.net / Simon Willison's Weblog

Jobs in AI, ML, Big Data

Data Engineer

Artificial Intelligence – Bioinformatic Expert

Lead Developer (AI)

Research Engineer

Ecosystem Manager

Founding AI Engineer, Agents