Cost/Performance Optimization with LLMs [Panel] | allainews.com

May 6, 2023, 1:52 p.m. | Demetrios Brinkmann

MLOps.community mlops.community

Sign up for the next LLM in production conference here: https://go.mlops.community/LLMinprod

Watch all the talks from the first conference: https://go.mlops.community/llmconfpart1

// Abstract
In this panel discussion, the topic of the cost of running large language models (LLMs) is explored, along with potential solutions. The benefits of bringing LLMs in-house, such as latency optimization and greater control, are also discussed. The panelists explore methods such as structured pruning and knowledge distillation for optimizing LLMs. OctoML's platform is mentioned as a tool …

abstract benefits control cost distillation knowledge language language models large language models latency llms octoml optimization panel platform pruning running solutions tool

More from mlops.community / MLOps.community

GenAI in Production - Challenges and Trends // Verena Weber // #224 2 days, 13 hours ago | mlops.community

ai consultant challenges companies consultant +17

[Exclusive] Databricks Roundtable // Introducing DBRX: The Future of Language Models 6 days, 22 hours ago | mlops.community

benchmarks brand code coffee +13

From MVP to Production // Day 2 Panel 2 // AI in Production Conference 1 week, 2 days ago | mlops.community

abstract ai models ai systems annotation +23

Data Engineering in the Federal Sector // Shane Morris // #223 1 week, 6 days ago | mlops.community

abstract automation autonomous autonomous systems +11

What Business Stakeholders Want to See from the ML Teams // Peter Guagenti // #222 2 weeks, 3 days ago | mlops.community

build business development drive +19

MLOps - Design Thinking to Build ML Infra for ML and LLM Use Casess // … 3 weeks ago | mlops.community

amazon analytics consultant consumer +13

Looking Back on 4 Years of the MLOps Community // Demetrios Brinkmann // #220 3 weeks, 3 days ago | mlops.community

abstract community data data science +10

The Art and Science of Training LLMs // Bandish Shah and Davis Blalock // #219 4 weeks ago | mlops.community

acquired ai training billion computing +26

Security and Privacy // Day 2 Panel 1 // AI in Production Conference 4 weeks, 2 days ago | mlops.community

abstract ads ai-driven future ai risks +20

(373) Applications Manager – Business Intelligence - BSTD

@ South African Reserve Bank | South Africa

View on ai-jobs.net

Data Engineer Talend (confirmé/sénior) - H/F - CDI

@ Talan | Paris, France

View on ai-jobs.net

Data Science Intern (Summer) / Stagiaire en données (été)

@ BetterSleep | Montreal, Quebec, Canada

View on ai-jobs.net

Director - Master Data Management (REMOTE)

@ Wesco | Pittsburgh, PA, United States

View on ai-jobs.net

Architect Systems BigData REF2649A

@ Deutsche Telekom IT Solutions | Budapest, Hungary

View on ai-jobs.net

Data Product Coordinator

@ Nestlé | São Paulo, São Paulo, BR, 04730-000

View on ai-jobs.net