Feb. 20, 2024, 5:44 a.m. | Kan Xu, Xuanyi Zhao, Hamsa Bastani, Osbert Bastani

cs.LG updates on arXiv.org arxiv.org

arXiv:2104.08928v3 Announce Type: replace-cross
Abstract: Unstructured text provides decision-makers with a rich data source in many domains, ranging from product reviews in retail to nursing notes in healthcare. To leverage this information, words are typically translated into word embeddings -- vectors that encode the semantic relationships between words -- through unsupervised learning algorithms such as matrix factorization. However, learning word embeddings from new domains with limited training data can be challenging, because the meaning/usage may be different in the new …

abstract arxiv cs.cl cs.lg data decision domains embeddings encode factorization healthcare information makers matrix notes nursing product product reviews relationships retail reviews rich data semantic stat.ml text through transfer transfer learning translated type unstructured unsupervised vectors word word embeddings words

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Global Data Architect, AVP - State Street Global Advisors

@ State Street | Boston, Massachusetts

Data Engineer

@ NTT DATA | Pune, MH, IN