Feb. 6, 2024, 5:42 a.m. | Chirag Chhablani Nikhita Sharma Jordan Hosier Vijay K. Gurbani

cs.LG updates on arXiv.org arxiv.org

Automatic Speech Recognition (ASR) systems are used in the financial domain to enhance the caller experience by enabling natural language understanding and facilitating efficient and intuitive interactions. Increasing use of ASR systems requires that such systems exhibit very low error rates. The predominant ASR models to collect numeric data are large, general-purpose commercial models -- Google Speech-to-text (STT), or Amazon Transcribe -- or open source (OpenAI's Whisper). Such ASR models are trained on hundreds of thousands of hours of audio …

asr automatic speech recognition commercial cs.cl cs.lg cs.sd data digits domain enabling error experience financial general interactions language language understanding low micro natural natural language recognition speech speech recognition systems transactions understanding

Founding AI Engineer, Agents

@ Occam AI | New York

AI Engineer Intern, Agents

@ Occam AI | US

AI Research Scientist

@ Vara | Berlin, Germany and Remote

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Data Scientist (Database Development)

@ Nasdaq | Bengaluru-Affluence