Nov. 23, 2022, 2:16 a.m. | Petr Zelina, Jana Halámková, Vít Nováček

cs.CL updates on arXiv.org arxiv.org

This work is motivated by the scarcity of tools for accurate, unsupervised
information extraction from unstructured clinical notes in computationally
underrepresented languages, such as Czech. We introduce a stepping stone to a
broad array of downstream tasks such as summarisation or integration of
individual patient records, extraction of structured information for national
cancer registry reporting or building of semi-structured semantic patient
representations for computing patient embeddings. More specifically, we present
a method for unsupervised extraction of semantically-labelled textual segments
from …

arxiv clustering extraction labelling notes unsupervised

Founding AI Engineer, Agents

@ Occam AI | New York

AI Engineer Intern, Agents

@ Occam AI | US

AI Research Scientist

@ Vara | Berlin, Germany and Remote

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Sr. Software Development Manager, AWS Neuron Machine Learning Distributed Training

@ Amazon.com | Cupertino, California, USA