April 22, 2024, 4:45 a.m. | Da Chang, Yu Li

cs.CV updates on arXiv.org arxiv.org

arXiv:2404.12734v1 Announce Type: new
Abstract: With the continuous development of OCR technology and the expansion of application fields, text recognition in complex scenes has become a key challenge. Factors such as multiple fonts, mixed scenes and complex layouts seriously affect the recognition accuracy of traditional OCR models. Although OCR models based on deep learning have performed well in specific fields or similar data sets in recent years, the generalization ability and robustness of the model are still a big challenge …

abstract accuracy application arxiv become challenge character recognition continuous continuous development cs.cv development expansion fields key mixed multiple ocr optical optical character recognition recognition technology text transformer type

Founding AI Engineer, Agents

@ Occam AI | New York

AI Engineer Intern, Agents

@ Occam AI | US

AI Research Scientist

@ Vara | Berlin, Germany and Remote

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Consultant Senior Power BI & Azure - CDI - H/F

@ Talan | Lyon, France