May 23, 2022, 1:12 a.m. | Ori Ernst, Avi Caciularu, Ori Shapira, Ramakanth Pasunuru, Mohit Bansal, Jacob Goldberger, Ido Dagan

cs.CL updates on arXiv.org arxiv.org

Text clustering methods were traditionally incorporated into multi-document
summarization (MDS) as a means for coping with considerable information
repetition. Particularly, clusters were leveraged to indicate information
saliency as well as to avoid redundancy. Such prior methods focused on
clustering sentences, even though closely related sentences usually contain
also non-aligned parts. In this work, we revisit the clustering approach,
grouping together sub-sentential propositions, aiming at more precise
information alignment. Specifically, our method detects salient propositions,
clusters them into paraphrastic clusters, and …

arxiv clustering summarization

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Social Insights & Data Analyst (Freelance)

@ Media.Monks | Jakarta

Cloud Data Engineer

@ Arkatechture | Portland, ME, USA