Feb. 14, 2024, 10:21 a.m. | /u/zerojames_

Machine Learning www.reddit.com

I built a system that lets you take a photo of a bookshelf and create an interactive HTML web page where you can click on books in an image to learn more about each one.

The tech stack for this project is:

* Grounded SAM to retrieve polygons for books.
* OpenCV + supervision transformations to prepare books for OCR.
* GPT-4 with Vision for OCR
* Google Books API to get book metadata.
* HTML + SVG generation to …

books click computer computer vision html image interactive learn learn more machinelearning making opencv page photo project sam stack tech tech stack vision web

Software Engineer for AI Training Data (School Specific)

@ G2i Inc | Remote

Software Engineer for AI Training Data (Python)

@ G2i Inc | Remote

Software Engineer for AI Training Data (Tier 2)

@ G2i Inc | Remote

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

Lead Developer (AI)

@ Cere Network | San Francisco, US