Feb. 12, 2024, 6:20 a.m. | /u/mrcet007

Machine Learning www.reddit.com

Which are good resources or book on efficiently deploying classical ML in production for very high throughput. Say 100k request per seconds for inference & need low latency.

I am not taking about scaling deploying transfomer or neural networks in production. But classical ML model for classicication/regression using say Lightgbm, Xgboost ,RF, SVM etc. for this scale.

Looking for sources which talk about improving model efficency, and data etl efficency for inference etc.

I couldnt find resource for classical ML …

book books good inference latency low low latency machinelearning networks neural networks per production resources scaling

Doctoral Researcher (m/f/div) in Automated Processing of Bioimages

@ Leibniz Institute for Natural Product Research and Infection Biology (Leibniz-HKI) | Jena

Research Scholar (Technical Research)

@ Centre for the Governance of AI | Hybrid; Oxford, UK

Backend Spark Developer

@ Talan | Warsaw, Poland

Pricing & Data Management Intern

@ Novelis | Atlanta, GA, United States

Sr Data Engineer

@ Visa | Bengaluru, India

Customer Analytics / Data Science - Lead Analyst - Analytics US Timezone

@ dentsu international | Bengaluru, India