all AI news
[P] Making my bookshelves clickable with computer vision
Feb. 14, 2024, 10:21 a.m. | /u/zerojames_
Machine Learning www.reddit.com
The tech stack for this project is:
* Grounded SAM to retrieve polygons for books.
* OpenCV + supervision transformations to prepare books for OCR.
* GPT-4 with Vision for OCR
* Google Books API to get book metadata.
* HTML + SVG generation to …
books click computer computer vision html image interactive learn learn more machinelearning making opencv page photo project sam stack tech tech stack vision web
More from www.reddit.com / Machine Learning
Jobs in AI, ML, Big Data
Software Engineer for AI Training Data (School Specific)
@ G2i Inc | Remote
Software Engineer for AI Training Data (Python)
@ G2i Inc | Remote
Software Engineer for AI Training Data (Tier 2)
@ G2i Inc | Remote
Data Engineer
@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania
Artificial Intelligence – Bioinformatic Expert
@ University of Texas Medical Branch | Galveston, TX
Lead Developer (AI)
@ Cere Network | San Francisco, US