Gradually increasing CPU load on using sentence embeddings model with kmeans | allainews.com

Feb. 20, 2024, 8:34 a.m. | /u/Devinco001

Deep Learning www.reddit.com

I am having a ML based production application, using flask, deployed on GCP server using gunicorn workers. In each incoming request, a text sentence is received.

It is using sentence transformers (**All-MiniLM-L6-v2** model), which is loaded globally one time, to create embeddings of the incoming text and then use pre trained kmeans (also loaded globally) to predict/map it to a intent cluster. Basically, goal is to find intent of the sentence.

I have ample resources and the requests are also …

application cpu deeplearning embeddings flask gcp kmeans production server text transformers workers

More from www.reddit.com / Deep Learning

Kolmogorov-Arnold Networks (KANs): A Promising Alternative for Better Accuracy and Interpretability in Deep Learning 14 hours ago | www.reddit.com

accuracy alternative deep learning deeplearning +2

What's your opinions about KAN? 18 hours ago | www.reddit.com

deeplearning opinions

I have a hard time understanding the maths behind AI but I also want to … 1 day, 9 hours ago | www.reddit.com

deeplearning maths suggestions understanding

if i was a freelancer deep learning engineer and i work with a company will … 1 day, 22 hours ago | www.reddit.com

computer deep learning deeplearning engineer +6

What does Speaker Embeddings consists of? 2 days, 1 hour ago | www.reddit.com

architecture deeplearning embeddings lstm +2

Physics-Based Deep Learning: Insights into Physics-Informed Neural Networks (PINNs) 2 days, 19 hours ago | www.reddit.com

deep learning deeplearning insights networks +3

How would one write the following loss function in python? I am currently stuck on … 3 days, 8 hours ago | www.reddit.com

deeplearning function loss python

Tensorflow vs pytorch 3 days, 12 hours ago | www.reddit.com

deep learning deeplearning hey library +5

What is best practice of augmentation on Imbalance dataset? 4 days, 5 hours ago | www.reddit.com

apply articles augmentation case +12

AI Research Scientist

@ Vara | Berlin, Germany and Remote

View on ai-jobs.net

Data Architect

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

View on ai-jobs.net

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

View on ai-jobs.net

Senior Machine Learning Engineer

@ Samsara | Canada - Remote

View on ai-jobs.net