experimental-phi3-webgpu

May 9, 2024, 10:21 p.m. |

Simon Willison's Weblog simonwillison.net

Run Microsoft’s excellent Phi-3 model directly in your browser, using WebGPU so didn’t work in Firefox for me, just in Chrome.

It fetches around 2.1GB of data into the browser cache on first run, but then gave me decent quality responses to my prompts running at an impressive 21 tokens a second (M2, 64GB).

I think Phi-3 is the highest quality model of this size, so it’s a really good fit for running in a browser like this.

Via …

ai browser browsers cache chrome data experimental firefox generativeai homebrewllms llms microsoft phi phi-3 prompts quality responses running the browser think tokens webassembly webgpu work

Visit resource

More from simonwillison.net / Simon Willison's Weblog

How (some) good corporate engineering blogs are written 16 hours ago | simonwillison.net

blogging blogs cloudflare companies +14

Stealing everything you’ve ever typed or viewed on your own Windows PC is now possible … 16 hours ago | simonwillison.net

code copilot disaster ever +12

Quoting Will Larson 1 day, 4 hours ago | simonwillison.net

art ceo companies cost +10

Man caught in scam after AI told him fake Facebook customer support number was legitimate 1 day, 7 hours ago | simonwillison.net

ai case chatbot customer +13

Django Enhancement Proposal 14: Background Workers 1 day, 16 hours ago | simonwillison.net

django ecosystem frameworks howard +12

Why, after 6 years, I’m over GraphQL 2 days, 14 hours ago | simonwillison.net

all in authorization complexity graphql +3

What does the public in six countries think of generative AI in news? 2 days, 17 hours ago | simonwillison.net

ai chatgpt evidence generative +15

Quoting Andrej Karpathy 2 days, 17 hours ago | simonwillison.net

ai algorithms andrej karpathy andrejkarpathy +14

Codestral: Hello, World! 2 days, 17 hours ago | simonwillison.net

ai code codestral derivatives +17

Senior Machine Learning Engineer

@ GPTZero | Toronto, Canada

View on ai-jobs.net

ML/AI Engineer / NLP Expert - Custom LLM Development (x/f/m)

@ HelloBetter | Remote

View on ai-jobs.net

Doctoral Researcher (m/f/div) in Automated Processing of Bioimages

@ Leibniz Institute for Natural Product Research and Infection Biology (Leibniz-HKI) | Jena

View on ai-jobs.net

Seeking Developers and Engineers for AI T-Shirt Generator Project

@ Chevon Hicks | Remote

View on ai-jobs.net

Cloud Data Platform Engineer

@ First Central | Home Office (Remote)

View on ai-jobs.net

Associate Director, Data Science

@ MSD | USA - New Jersey - Rahway

View on ai-jobs.net

all AI news

experimental-phi3-webgpu

More from simonwillison.net / Simon Willison's Weblog

Jobs in AI, ML, Big Data

Senior Machine Learning Engineer

ML/AI Engineer / NLP Expert - Custom LLM Development (x/f/m)

Doctoral Researcher (m/f/div) in Automated Processing of Bioimages

Seeking Developers and Engineers for AI T-Shirt Generator Project

Cloud Data Platform Engineer

Associate Director, Data Science