Oct. 12, 2023, 4:38 p.m. | Oleg Kokorin

Hacker Noon - ai hackernoon.com

OCR software alone can't handle complex documents — special symbols, rotated text, low-quality scans.
Using deep learning, one can augment ready-made OCR solutions and allow for processing of complex documents. From removing false positives to using binary matrices to detect complex spreadsheets, deep learning can handle any document.
In this article I describe my experience with developing a system for detecting technical drawings of floor plans, the perfect example of applying modern CV and AI to complex document digitization.

Read …

ai binary computer vision deep learning document documents false false positives image recognition low machine learning neural networks ocr ocr software processing quality recognition scans software solutions spreadsheets text work

Founding AI Engineer, Agents

@ Occam AI | New York

AI Engineer Intern, Agents

@ Occam AI | US

AI Research Scientist

@ Vara | Berlin, Germany and Remote

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Sr. Software Development Manager, AWS Neuron Machine Learning Distributed Training

@ Amazon.com | Cupertino, California, USA