Aug. 9, 2022, 1:12 a.m. | Roi Ronen, Shahar Tsiper, Oron Anschel, Inbal Lavi, Amir Markovitz, R. Manmatha

cs.CV updates on arXiv.org arxiv.org

In recent years, the dominant paradigm for text spotting is to combine the
tasks of text detection and recognition into a single end-to-end framework.
Under this paradigm, both tasks are accomplished by operating over a shared
global feature map extracted from the input image. Among the main challenges
that end-to-end approaches face is the performance degradation when recognizing
text across scale variations (smaller or larger text), and arbitrary word
rotation angles. In this work, we address these challenges by proposing …

arxiv attention cv global local attention text

Founding AI Engineer, Agents

@ Occam AI | New York

AI Engineer Intern, Agents

@ Occam AI | US

AI Research Scientist

@ Vara | Berlin, Germany and Remote

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne