all AI news
Speed and Sensibility: Balancing Latency and UX in Generative AI // Julia Kroll // LLMs III LT
Nov. 7, 2023, 9:16 a.m. | MLOps.community
MLOps.community www.youtube.com
Conversational AI demands low latency for a seamless dialogue between humans and AI. However, engineers face the dilemma that some latency is inherently required in order to process human speech and craft a response. Some incremental wins to shave off milliseconds involve trade-offs against how the AI response could be enriched during the additional processing time. Others simply refactor out inefficiency to obtain more performant results from AI devtools. This talk presents best practices of designing streaming speech-to-text …
abstract conversational conversational ai dialogue engineers face generative human humans iii incremental julia latency llms low process speech speed
More from www.youtube.com / MLOps.community
Leading Enterprise Data Teams // Sol Rashidi // MLOps Podcast #227
4 days, 16 hours ago |
www.youtube.com
Jobs in AI, ML, Big Data
Data Architect
@ University of Texas at Austin | Austin, TX
Data ETL Engineer
@ University of Texas at Austin | Austin, TX
Lead GNSS Data Scientist
@ Lurra Systems | Melbourne
Senior Machine Learning Engineer (MLOps)
@ Promaton | Remote, Europe
MLOps Engineer - Hybrid Intelligence
@ Capgemini | Madrid, M, ES
Analista de Business Intelligence (Industry Insights)
@ NielsenIQ | Cotia, Brazil