Oct. 18, 2023, 3:28 a.m. | /u/ssiddharth408

Deep Learning www.reddit.com

I am developing a invoice data extraction using ocr and named entity recognition but some invoices have text arranged top to bottom and ocr reads line by line which makes it difficult to get Bill to details or ship to details. I searched and found that we can do layout analysis/classification to get region of interest. Can someone share any example or sample how to mark regions for better results as I am confused do i need to mark every …

analysis bill classification data data extraction deeplearning extraction found invoice line ocr recognition text

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Business Data Scientist, gTech Ads

@ Google | Mexico City, CDMX, Mexico

Lead, Data Analytics Operations

@ Zocdoc | Pune, Maharashtra, India