April 22, 2024, 4:42 a.m. | Yang Hong, Yinfei Li, Xiaojun Qiao, Rui Li, Junsong Zhang

cs.LG updates on arXiv.org arxiv.org

arXiv:2404.12693v1 Announce Type: cross
Abstract: Learning effective representations for Chinese characters presents unique challenges, primarily due to the vast number of characters and their continuous growth, which requires models to handle an expanding category space. Additionally, the inherent sparsity of character usage complicates the generalization of learned representations. Prior research has explored radical-based sequences to overcome these issues, achieving progress in recognizing unseen characters. However, these approaches fail to fully exploit the inherent tree structure of such sequences. To address …

abstract arxiv challenges characters chinese continuous cs.cv cs.lg growth improving prior representation research space sparsity tree type unique usage vast

Founding AI Engineer, Agents

@ Occam AI | New York

AI Engineer Intern, Agents

@ Occam AI | US

AI Research Scientist

@ Vara | Berlin, Germany and Remote

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne