March 29, 2024, 4:48 a.m. | Nicolas Gutehrl\'e (CRIT), Iana Atanassova (CRIT, STIH, TESNIERE, LaLIC)

cs.CL updates on arXiv.org arxiv.org

arXiv:2403.19201v1 Announce Type: cross
Abstract: The digitisation campaigns carried out by libraries and archives in recent years have facilitated access to documents in their collections. However, exploring and exploiting these documents remain difficult tasks due to the sheer quantity of documents available for consultation. In this article, we show how the semantic annotation of the textual content of study corpora of archival documents allow to facilitate their exploitation and valorisation. First, we present a methodological framework for the construction of …

abstract annotation archives article arxiv campaigns cs.cl cs.dl digitisation documents however interfaces libraries research semantic tasks type understanding

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Global Data Architect, AVP - State Street Global Advisors

@ State Street | Boston, Massachusetts

Data Engineer

@ NTT DATA | Pune, MH, IN