Jan. 19, 2022, 2:17 p.m. | /u/polandtown

Natural Language Processing www.reddit.com

Hello fellow enthusiasts,

I have a corpus of 150k documents, and their respective OCR outputs.

I'd like to assign a Readability score to each document, is there a metric out there for something like that?

In retrospect to my OCR extraction, which took almost a month of runtime to run, I could have extracted an OCR-accuracy score along with my strings. I'd like to find an alternative solution instead of re-running it. Knowledge for next time, anyways...

I'm open to …

languagetechnology string

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

IT Commercial Data Analyst - ESO

@ National Grid | Warwick, GB, CV34 6DA

Stagiaire Data Analyst – Banque Privée - Juillet 2024

@ Rothschild & Co | Paris (Messine-29)

Operations Research Scientist I - Network Optimization Focus

@ CSX | Jacksonville, FL, United States

Machine Learning Operations Engineer

@ Intellectsoft | Baku, Baku, Azerbaijan - Remote

Data Analyst

@ Health Care Service Corporation | Richardson Texas HQ (1001 E. Lookout Drive)