Oct. 18, 2022, 1:12 a.m. | Juan Zuluaga-Gomez, Seyyed Saeed Sarfjoo, Amrutha Prasad, Iuliia Nigmatulina, Petr Motlicek, Karel Ondrej, Oliver Ohneiser, Hartmut Helmke

cs.CL updates on arXiv.org arxiv.org

Automatic speech recognition (ASR) allows transcribing the communications
between air traffic controllers (ATCOs) and aircraft pilots. The transcriptions
are used later to extract ATC named entities, e.g., aircraft callsigns. One
common challenge is speech activity detection (SAD) and speaker diarization
(SD). In the failure condition, two or more segments remain in the same
recording, jeopardizing the overall performance. We propose a system that
combines SAD and a BERT model to perform speaker change detection and speaker
role detection (SRD) by …

air traffic arxiv bert change communications detection role traffic

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

Lead Developer (AI)

@ Cere Network | San Francisco, US

Research Engineer

@ Allora Labs | Remote

Ecosystem Manager

@ Allora Labs | Remote

Founding AI Engineer, Agents

@ Occam AI | New York