all AI news
How CPython Implements and Uses Bloom Filters for String Processing
Sept. 16, 2023, 10:32 p.m. |
Simon Willison's Weblog simonwillison.net
How CPython Implements and Uses Bloom Filters for String Processing
Fascinating dive into Python string internals by Abhinav Upadhyay. It turns out CPython uses very simple bloom filters in several parts of the core string methods, to solve problems like splitting on newlines where there are actually eight codepoints that could represent a newline, and a tiny bloom filter can help filter a character in a single operation before performing all eight comparisons only if that first check failed.
bloom core filters performance processing python simple solve string
More from simonwillison.net / Simon Willison's Weblog
AI counter app from my PyCon US keynote
1 day, 20 hours ago |
simonwillison.net
Understand errors and warnings better with Gemini
2 days, 14 hours ago |
simonwillison.net
Commit: Add a shared credentials relationship from twitter.com to x.com
2 days, 16 hours ago |
simonwillison.net
Jobs in AI, ML, Big Data
Software Engineer for AI Training Data (School Specific)
@ G2i Inc | Remote
Software Engineer for AI Training Data (Python)
@ G2i Inc | Remote
Software Engineer for AI Training Data (Tier 2)
@ G2i Inc | Remote
Data Engineer
@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania
Artificial Intelligence – Bioinformatic Expert
@ University of Texas Medical Branch | Galveston, TX
Lead Developer (AI)
@ Cere Network | San Francisco, US