Jan. 16, 2024, 5 p.m. | Pragati Jhunjhunwala

MarkTechPost www.marktechpost.com

In a recent tweet from the founder of Dataquest.io, Vik Paruchuri recently publicized the launch of a multilingual document OCR toolkit, Surya. The framework can efficiently detect line-level bboxes and column breaks in documents, scanned images, or presentations. The existing text detection models like Tesseract work at the word or character level, while this open-source […]


The post Meet Surya: A Multilingual Text Line Detection AI Model for Documents appeared first on MarkTechPost.

ai model ai shorts applications artificial intelligence column detection document documents editors pick founder framework images launch line multilingual ocr presentations staff tech news technology tesseract text toolkit tweet word work

More from www.marktechpost.com / MarkTechPost

Software Engineer for AI Training Data (School Specific)

@ G2i Inc | Remote

Software Engineer for AI Training Data (Python)

@ G2i Inc | Remote

Software Engineer for AI Training Data (Tier 2)

@ G2i Inc | Remote

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

Lead Developer (AI)

@ Cere Network | San Francisco, US