Feb. 27, 2024, 5:47 a.m. | Mingkun Yang, Biao Yang, Minghui Liao, Yingying Zhu, Xiang Bai

cs.CV updates on arXiv.org arxiv.org

arXiv:2402.15806v1 Announce Type: new
Abstract: Scene text recognition (STR) is a challenging task that requires large-scale annotated data for training. However, collecting and labeling real text images is expensive and time-consuming, which limits the availability of real data. Therefore, most existing STR methods resort to synthetic data, which may introduce domain discrepancy and degrade the performance of STR models. To alleviate this problem, recent semi-supervised STR methods exploit unlabeled real data by enforcing character-level consistency regularization between weakly and strongly …

abstract annotated data arxiv availability cs.cv data domain images labeling real data recognition scale semantic semi-supervised synthetic synthetic data text training type visual

Software Engineer for AI Training Data (School Specific)

@ G2i Inc | Remote

Software Engineer for AI Training Data (Python)

@ G2i Inc | Remote

Software Engineer for AI Training Data (Tier 2)

@ G2i Inc | Remote

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

Lead Developer (AI)

@ Cere Network | San Francisco, US