Oct. 28, 2022, 1:16 a.m. | Jason Hoelscher-Obermaier, Edward Stevinson, Valentin Stauber, Ivaylo Zhelev, Victor Botev, Ronin Wu, Jeremy Minton

cs.CL updates on arXiv.org arxiv.org

The most interesting words in scientific texts will often be novel or rare.
This presents a challenge for scientific word embedding models to determine
quality embedding vectors for useful terms that are infrequent or newly
emerging. We demonstrate how \gls{lsi} can address this problem by imputing
embeddings for domain-specific words from up-to-date knowledge graphs while
otherwise preserving the original word embedding model. We use the MeSH
knowledge graph to impute embedding vectors for biomedical terminology without
retraining and evaluate the …

arxiv graphs imputation knowledge knowledge graphs semantic word embeddings

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Research Scientist

@ Meta | Menlo Park, CA

Principal Data Scientist

@ Mastercard | O'Fallon, Missouri (Main Campus)