May 7, 2024, 4:50 a.m. | Maxim Ifergan, Renana Keydar, Omri Abend, Amit Pinchevski

cs.CL updates on arXiv.org arxiv.org

arXiv:2405.02650v1 Announce Type: new
Abstract: The vast collection of Holocaust survivor testimonies presents invaluable historical insights but poses challenges for manual analysis. This paper leverages advanced Natural Language Processing (NLP) techniques to explore the USC Shoah Foundation Holocaust testimony corpus. By treating testimonies as structured question-and-answer sections, we apply topic modeling to identify key themes. We experiment with BERTopic, which leverages recent advances in language modeling technology. We align testimony sections into fixed parts, revealing the evolution of topics across …

abstract advanced analysis apply arxiv challenges collection cs.ai cs.cl explore foundation holocaust insights language language processing modeling narrative natural natural language natural language processing nlp outliers paper patterns processing question topic modeling type usc vast

Software Engineer for AI Training Data (School Specific)

@ G2i Inc | Remote

Software Engineer for AI Training Data (Python)

@ G2i Inc | Remote

Software Engineer for AI Training Data (Tier 2)

@ G2i Inc | Remote

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

Lead Developer (AI)

@ Cere Network | San Francisco, US