Aug. 17, 2022, 3:27 a.m. | /u/osicli

Machine Learning www.reddit.com

PP-Structure is an OCR toolkit that can be used for document analysis and processing with complex structures, designed to help developers better complete document understanding tasks

​

\* Support the layout analysis of documents, divide the documents into 5 types of areas \*\*text, title, table, image and list\*\* (conjunction with Layout-Parser)

\* Support to extract the texts from the text, title, picture and list areas (used in conjunction with PP-OCR)

\* Support to extract excel files from the table areas …

extraction machinelearning table extraction tool

Lead Developer (AI)

@ Cere Network | San Francisco, US

Research Engineer

@ Allora Labs | Remote

Ecosystem Manager

@ Allora Labs | Remote

Founding AI Engineer, Agents

@ Occam AI | New York

AI Engineer Intern, Agents

@ Occam AI | US

AI Research Scientist

@ Vara | Berlin, Germany and Remote