Speed and Sensibility: Balancing Latency and UX in Generative AI // Julia Kroll // LLMs III LT | allainews.com

Nov. 7, 2023, 9:16 a.m. | MLOps.community

MLOps.community www.youtube.com

// Abstract
Conversational AI demands low latency for a seamless dialogue between humans and AI. However, engineers face the dilemma that some latency is inherently required in order to process human speech and craft a response. Some incremental wins to shave off milliseconds involve trade-offs against how the AI response could be enriched during the additional processing time. Others simply refactor out inefficiency to obtain more performant results from AI devtools. This talk presents best practices of designing streaming speech-to-text …

abstract conversational conversational ai dialogue engineers face generative human humans iii incremental julia latency llms low process speech speed

More from www.youtube.com / MLOps.community

Handling Multi-Terabyte LLM Checkpoints // Simon Karasik // MLOps Podcast #228 19 hours ago | www.youtube.com

abstract big cloud cloud storage +15

Leading Enterprise Data Teams // Sol Rashidi // MLOps Podcast #227 4 days, 16 hours ago | www.youtube.com

abstract building cases ceo +20

The Changing Face of AI Engineering // Amritha Arun Babu & Abhik Choudhury // Podcast … 5 days, 20 hours ago | www.youtube.com

ai engineer analytics build cases +18

Building Conversational AI Agents with Voice // Michelle Chan // AI in Production Conference 6 days ago | www.youtube.com

abstract agents ai agents baseten +21

Reliable Hallucination Detection in Large Language Models // Jiaxin Zhang // AI in Production Talk 6 days ago | www.youtube.com

abstract detection hallucination hallucinations +11

Fostering Connections and Careers with MLOps Community // Demetrios Brinkmann // Podcast #220 clip 6 days, 16 hours ago | www.youtube.com

building community discuss founder +9

Shipping LLMs: Buckle Up & Enjoy the Ride // Rex Harris // AI in Production … 1 week ago | www.youtube.com

abstract adventure buckle up challenges +18

Accelerate ML Production with Agents // Salma Mayorquin // AI in Production Conference 1 week ago | www.youtube.com

abstract abstraction agents challenges +16

DSPy Assertions: Computational Constraints for Self-Refining LM Pipelines // Arnav Singhvi // Talk 1 week ago | www.youtube.com

abstract challenge computational constraints +13

Data Architect

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

View on ai-jobs.net

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

View on ai-jobs.net

MLOps Engineer - Hybrid Intelligence

@ Capgemini | Madrid, M, ES

View on ai-jobs.net

Analista de Business Intelligence (Industry Insights)

@ NielsenIQ | Cotia, Brazil

View on ai-jobs.net