Jan. 16, 2024, 5 p.m. | Pragati Jhunjhunwala

MarkTechPost www.marktechpost.com

In a recent tweet from the founder of Dataquest.io, Vik Paruchuri recently publicized the launch of a multilingual document OCR toolkit, Surya. The framework can efficiently detect line-level bboxes and column breaks in documents, scanned images, or presentations. The existing text detection models like Tesseract work at the word or character level, while this open-source […]


The post Meet Surya: A Multilingual Text Line Detection AI Model for Documents appeared first on MarkTechPost.

ai model ai shorts applications artificial intelligence column detection document documents editors pick founder framework images launch line multilingual ocr presentations staff tech news technology tesseract text toolkit tweet word work

More from www.marktechpost.com / MarkTechPost

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

#13721 - Data Engineer - AI Model Testing

@ Qualitest | Miami, Florida, United States

Elasticsearch Administrator

@ ManTech | 201BF - Customer Site, Chantilly, VA