Cost Optimization and Performance // LLMs in Production Conference Panel Discussion 2 | allainews.com

May 8, 2023, 12:43 p.m. | MLOps.community

MLOps.community www.youtube.com

// Abstract
In this panel discussion, the topic of the cost of running large language models (LLMs) is explored, along with potential solutions. The benefits of bringing LLMs in-house, such as latency optimization and greater control, are also discussed. The panelists explore methods such as structured pruning and knowledge distillation for optimizing LLMs. OctoML's platform is mentioned as a tool for the automatic deployment of custom models and for selecting the most appropriate hardware for them. Overall, the discussion provides …

abstract benefits conference control cost language language models large language models latency llms optimization panel performance production pruning running solutions

More from www.youtube.com / MLOps.community

Simplicity and Metrics // Sam Bean // MLOps podcast #217 clip 20 hours ago | www.youtube.com

algorithms applied ai complexity e2e +15

Innovative Gen AI Applications: Beyond Text // MLOps Mini Summit #5 1 day, 14 hours ago | www.youtube.com

abstract ai applications applications beyond +20

Enabling Efficient Trillion Parameter Scale Training for Deep Learning Models // Tunji Ruwase 1 day, 17 hours ago | www.youtube.com

abstract artificial artificial intelligence conference +21

Graphs and Language // Louis Guitton // AI in Production Lightning Talk 1 day, 17 hours ago | www.youtube.com

abstract bio build engineering +16

GenAI in Production - Challenges and Trends // Verena Weber // MLOps Podcast #224 2 days, 10 hours ago | www.youtube.com

abstract ai consultant challenges consultant +11

Seeing Like a Language Model // Linus Lee // AI in Production Conference Full Talk 3 days, 22 hours ago | www.youtube.com

abstract conference embeddings exploration +13

From Robotics to AI NPCs // Nyla Worker // AI in Production Talk 3 days, 22 hours ago | www.youtube.com

abstract ai techniques characters context +17

[Exclusive] Databricks Roundtable // Introducing DBRX: The Future of Language Models 6 days, 14 hours ago | www.youtube.com

benchmarks brand code coffee +13

AI Safety and Reliability // Petar Tsankov // MLOps podcast #218 clip 1 week ago | www.youtube.com

adversarial applications cases ceo +18

Senior Data Engineer

@ Publicis Groupe | New York City, United States

View on ai-jobs.net

Associate Principal Robotics Engineer - Research.

@ Dyson | United Kingdom - Hullavington Office

View on ai-jobs.net

Duales Studium mit vertiefter Praxis: Bachelor of Science Künstliche Intelligenz und Data Science (m/w/d)

@ Gerresheimer | Wackersdorf, Germany

View on ai-jobs.net

AI/ML Engineer (TS/SCI) {S}

@ ARKA Group, LP | Aurora, Colorado, United States

View on ai-jobs.net

Data Integration Engineer

@ Find.co | Sliema

View on ai-jobs.net

Data Engineer

@ Q2 | Bengaluru, India

View on ai-jobs.net