HuSpaCy: an industrial-strength Hungarian natural language processing toolkit. (arXiv:2201.01956v1 [cs.CL]) | allainews.com

Jan. 7, 2022, 2:10 a.m. | György Orosz, Zsolt Szántó, Péter Berkecz, Gergő Szabó, Richárd Farkas

stat.ML updates on arXiv.org arxiv.org

Although there are a couple of open-source language processing pipelines
available for Hungarian, none of them satisfies the requirements of today's NLP
applications. A language processing pipeline should consist of close to
state-of-the-art lemmatization, morphosyntactic analysis, entity recognition
and word embeddings. Industrial text processing applications have to satisfy
non-functional software quality requirements, what is more, frameworks
supporting multiple languages are more and more favored. This paper introduces
HuSpaCy, an industryready Hungarian language processing pipeline. The presented
tool provides components for …

arxiv hungarian industrial language natural natural language natural language processing processing toolkit

More from arxiv.org / stat.ML updates on arXiv.org

Fused Extended Two-Way Fixed Effects for Difference-in-Differences with Staggered Adoptions 10 hours ago | arxiv.org

abstract arxiv bias canonical +16

Dropout Regularization Versus $\ell_2$-Penalization in the Linear Model 10 hours ago | arxiv.org

abstract arxiv behavior convergence +15

Partial recovery and weak consistency in the non-uniform hypergraph Stochastic Block Model 10 hours ago | arxiv.org

abstract algorithm arxiv block +15

Estimating the Number of Components in Finite Mixture Models via Variational Approximation 10 hours ago | arxiv.org

abstract approximation arxiv bayes +11

Conformalized Ordinal Classification with Marginal and Conditional Coverage 10 hours ago | arxiv.org

abstract algorithm applications arxiv +16

Multi-Study R-Learner for Estimating Heterogeneous Treatment Effects Across Studies Using Statistical Machine Learning 19 hours ago | arxiv.org

abstract arxiv effects machine +15

Spatial best linear unbiased prediction: A computational mathematics approach for high dimensional massive datasets 19 hours ago | arxiv.org

abstract arxiv challenges classification +20

Estimation Sample Complexity of a Class of Nonlinear Continuous-time Systems 2 days, 10 hours ago | arxiv.org

abstract arxiv class complexity +14

Estimation and Uniform Inference in Sparse High-Dimensional Additive Models 2 days, 10 hours ago | arxiv.org

abstract arxiv confidence construct +9

Data Architect

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

View on ai-jobs.net

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

View on ai-jobs.net

Senior AI & Data Engineer

@ Bertelsmann | Kuala Lumpur, 14, MY, 50400

View on ai-jobs.net

Analytics Engineer

@ Reverse Tech | Philippines - Remote

View on ai-jobs.net