s
Sept. 4, 2023, 9:13 p.m. |

Simon Willison's Weblog simonwillison.net

Wikipedia search-by-vibes through millions of pages offline


Really cool demo by Lee Butterman, who built embeddings of 2 million Wikipedia pages and figured out how to serve them directly to the browser, where they are used to implement "vibes based" similarity search returning results in 250ms. Lots of interesting details about how he pulled this off, using Arrow as the file format and ONNX to run the model in the browser.


Via @leebutterman

browser demo embedding embeddings offline search serve the browser them through vibes webassembly wikipedia

AI Research Scientist

@ Vara | Berlin, Germany and Remote

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer

@ Samsara | Canada - Remote

Machine Learning & Data Engineer - Consultant

@ Arcadis | Bengaluru, Karnataka, India