experimental-phi3-webgpu

May 9, 2024, 10:21 p.m. |

Simon Willison's Weblog simonwillison.net

Run Microsoft’s excellent Phi-3 model directly in your browser, using WebGPU so didn’t work in Firefox for me, just in Chrome.

It fetches around 2.1GB of data into the browser cache on first run, but then gave me decent quality responses to my prompts running at an impressive 21 tokens a second (M2, 64GB).

I think Phi-3 is the highest quality model of this size, so it’s a really good fit for running in a browser like this.

Via …

ai browser browsers cache chrome data experimental firefox generativeai homebrewllms llms microsoft phi phi-3 prompts quality responses running the browser think tokens webassembly webgpu work

Visit resource

More from simonwillison.net / Simon Willison's Weblog

Spam, junk … slop? The latest wave of AI behind the ‘zombie internet’ 21 hours ago | simonwillison.net

ai ethics generativeai internet +7

NumFOCUS DISCOVER Cookbook: Minimal Measures 22 hours ago | simonwillison.net

accessibility collection conferences diversity +8

Fast groq-hosted LLMs vs browser jank 1 day, 3 hours ago | simonwillison.net

browser browsers callback every +12

A Plea for Sober AI 1 day, 16 hours ago | simonwillison.net

ai drewbreunig generativeai good +6

AI counter app from my PyCon US keynote 2 days, 1 hour ago | simonwillison.net

ai app artificial artificial intelligence +11

Quoting Patrick Reynolds 2 days, 15 hours ago | simonwillison.net

building change codebase data +2

Understand errors and warnings better with Gemini 2 days, 19 hours ago | simonwillison.net

ai applications chrome chrome devtools +23

Commit: Add a shared credentials relationship from twitter.com to x.com 2 days, 21 hours ago | simonwillison.net

apple json manager password +5

Quoting Kelsey Piper 2 days, 22 hours ago | simonwillison.net

agreement ai document employee +6

Software Engineer for AI Training Data (School Specific)

@ G2i Inc | Remote

View on ai-jobs.net

Software Engineer for AI Training Data (Python)

@ G2i Inc | Remote

View on ai-jobs.net

Software Engineer for AI Training Data (Tier 2)

@ G2i Inc | Remote

View on ai-jobs.net

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

View on ai-jobs.net

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

View on ai-jobs.net

Lead Developer (AI)

@ Cere Network | San Francisco, US

View on ai-jobs.net

all AI news

experimental-phi3-webgpu

More from simonwillison.net / Simon Willison's Weblog

Jobs in AI, ML, Big Data

Software Engineer for AI Training Data (School Specific)

Software Engineer for AI Training Data (Python)

Software Engineer for AI Training Data (Tier 2)

Data Engineer

Artificial Intelligence – Bioinformatic Expert

Lead Developer (AI)