Feb. 27, 2024, 5:50 a.m. | Xunjian Yin, Xinyu Hu, Jin Jiang, Xiaojun Wan

cs.CL updates on arXiv.org arxiv.org

arXiv:2211.07843v2 Announce Type: replace
Abstract: Chinese Spelling Check (CSC) aims to detect and correct error tokens in Chinese contexts, which has a wide range of applications. However, it is confronted with the challenges of insufficient annotated data and the issue that previous methods may actually not fully leverage the existing datasets. In this paper, we introduce our plug-and-play retrieval method with error-robust information for Chinese Spelling Check (RERIC), which can be directly applied to existing CSC models. The datastore for …

abstract annotated data applications arxiv challenges check chinese cs.cl data datasets error issue paper retrieval robust tokens type

AI Research Scientist

@ Vara | Berlin, Germany and Remote

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Business Data Analyst

@ Alstom | Johannesburg, GT, ZA