An End-to-End OCR Framework for Robust Arabic-Handwriting Recognition using a Novel Transformers-based Model and an Innovative 270 Million-Words Multi-Font Corpus of Classical Arabic with Diacritics. (arXiv:2208.11484v1 [cs.CV]) | allainews.com

Aug. 25, 2022, 1:19 a.m. | Aly Mostafa, Omar Mohamed, Ali Ashraf, Ahmed Elbehery, Salma Jamal, Anas Salah, Amr S. Ghoneim

cs.CV updates on arXiv.org arxiv.org

This research is the second phase in a series of investigations on developing
an Optical Character Recognition (OCR) of Arabic historical documents and
examining how different modeling procedures interact with the problem. The
first research studied the effect of Transformers on our custom-built Arabic
dataset. One of the downsides of the first research was the size of the
training data, a mere 15000 images from our 30 million images, due to lack of
resources. Also, we add an image enhancement …

arxiv cv framework handwriting ocr transformers words

More from arxiv.org / cs.CV updates on arXiv.org

AV-RIR: Audio-Visual Room Impulse Response Estimation 7 hours ago | arxiv.org

arxiv audio cs.cv cs.sd +3

A Hierarchical Architecture for Neural Materials 7 hours ago | arxiv.org

abstract architecture arxiv cs.cv +8

SegFormer3D: an Efficient Transformer for 3D Medical Image Segmentation 7 hours ago | arxiv.org

arxiv cs.cv image medical +3

NU-Class Net: A Novel Deep Learning-based Approach for Video Quality Enhancement 7 hours ago | arxiv.org

abstract arxiv class compression +18

Mosaic-SDF for 3D Generative Models 7 hours ago | arxiv.org

2d image abstract arxiv cs.cv +14

PTT: Point-Trajectory Transformer for Efficient Temporal 3D Object Detection 7 hours ago | arxiv.org

3d object 3d object detection arxiv cs.cv +6

A Multilevel Guidance-Exploration Network and Behavior-Scene Matching Method for Human Behavior Anomaly Detection 7 hours ago | arxiv.org

anomaly anomaly detection arxiv behavior +7

ChatPose: Chatting about 3D Human Pose 7 hours ago | arxiv.org

abstract arxiv cs.cv framework +14

Boosting Audio-visual Zero-shot Learning with Large Language Models 7 hours ago | arxiv.org

arxiv audio boosting cs.cv +7

Data Architect

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

View on ai-jobs.net

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

View on ai-jobs.net

Data Management Assistant

@ World Vision | Amman Office, Jordan

View on ai-jobs.net

Cloud Data Engineer, Global Services Delivery, Google Cloud

@ Google | Buenos Aires, Argentina

View on ai-jobs.net