July 19, 2022, 12:06 p.m. | /u/Boglbert

Data Science www.reddit.com

Hi,

I am curious in your experiences with table data extraction with OCR and python (image to structured data).

I tried out several things so far and had best (not perfect but ok) results with Detectron & LayoutParser.

Do you suggest additional training of the LayoutParser to optimise the output? Have you used other (free or maybe cheap) services to get best results (data frame / csv -esque outputs).

Everything appreciated. Cheers.

datascience extraction ocr python table extraction

Founding AI Engineer, Agents

@ Occam AI | New York

AI Engineer Intern, Agents

@ Occam AI | US

AI Research Scientist

@ Vara | Berlin, Germany and Remote

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne