Aug. 30, 2022, 1:13 a.m. | Aly Mostafa, Omar Mohamed, Ali Ashraf, Ahmed Elbehery, Salma Jamal, Anas Salah, Amr S. Ghoneim

cs.CL updates on arXiv.org arxiv.org

This research is the second phase in a series of investigations on developing
an Optical Character Recognition (OCR) of Arabic historical documents and
examining how different modeling procedures interact with the problem. The
first research studied the effect of Transformers on our custom-built Arabic
dataset. One of the downsides of the first research was the size of the
training data, a mere 15000 images from our 30 million images, due to lack of
resources. Also, we add an image enhancement …

arxiv framework handwriting ocr transformers words

Data Scientist (m/f/x/d)

@ Symanto Research GmbH & Co. KG | Spain, Germany

Associate Data Engineer

@ Redkite | London, England, United Kingdom

Data Management Associate Consultant

@ SAP | Porto Salvo, PT, 2740-262

NLP & Data Modelling Consultant - SAP LABS

@ SAP | Bengaluru, IN, 560066

Catalog Data Quality Specialist

@ Delivery Hero | Montevideo, Uruguay

Data Analyst for CEO Office with Pathway to Functional Analyst

@ Amar Bank | Jakarta