all AI news
Topic Detection and Tracking with Time-Aware Document Embeddings
March 27, 2024, 4:48 a.m. | Hang Jiang, Doug Beeferman, Weiquan Mao, Deb Roy
cs.CL updates on arXiv.org arxiv.org
Abstract: The time at which a message is communicated is a vital piece of metadata in many real-world natural language processing tasks such as Topic Detection and Tracking (TDT). TDT systems aim to cluster a corpus of news articles by event, and in that context, stories that describe the same event are likely to have been written at around the same time. Prior work on time modeling for TDT takes this into account, but does not …
abstract aim articles arxiv cluster context cs.cl detection document embeddings event language language processing metadata natural natural language natural language processing processing stories systems tasks tracking type vital world
More from arxiv.org / cs.CL updates on arXiv.org
Jobs in AI, ML, Big Data
AI Research Scientist
@ Vara | Berlin, Germany and Remote
Data Architect
@ University of Texas at Austin | Austin, TX
Data ETL Engineer
@ University of Texas at Austin | Austin, TX
Lead GNSS Data Scientist
@ Lurra Systems | Melbourne
Senior Machine Learning Engineer (MLOps)
@ Promaton | Remote, Europe
Robotics Technician - 3rd Shift
@ GXO Logistics | Perris, CA, US, 92571