Looking to build a $200K GPU server to serve API requests for Stable Diffusion, Segment Anything and TTS models. What's my best option? [D] | allainews.com

July 2, 2023, 4:48 a.m. | /u/Accretence

Machine Learning www.reddit.com

Theoretically we'll be serving a large number of users and I am worried about clogging entire GPUs for example on a Stable Diffusion request that would take Nvidia A100 about 20 seconds to fulfill.

Do I need as many GPUs as my concurrent users?

Is there any relevant guide on GPU virtualizations?

a100 api diffusion example gpu gpus machinelearning nvidia nvidia a100 serve server stable diffusion tts

More from www.reddit.com / Machine Learning

[D] Intra-Document prefix (cumulative) sum when using sequence packing in PyTorch 5 hours ago | www.reddit.com

computational context context window documents +7

[Research] xLSTM: Extended Long Short-Term Memory 12 hours ago | www.reddit.com

abstract contributed deep learning error +16

Non Technical ML Podcasts? [D] 19 hours ago | www.reddit.com

challenge context current data +16

[D] PEFT techniques actually used in the industry 22 hours ago | www.reddit.com

industry machinelearning normally peft +2

[D] Can anyone with the expertise speak to the overlap, or not, between Nvidia's hardware … 1 day ago | www.reddit.com

apple chips expertise hardware +4

[P] Skyrim - Open-source model zoo for Large Weather Models 1 day, 1 hour ago | www.reddit.com

ai models building capabilities fine-tuning +7

[P] Identify toxic underwater air bubbles lurking in the substrate with aquatic ultrasonic scans via … 1 day, 3 hours ago | www.reddit.com

arduino classification color identify +11

[P] YARI - Yet Another RAG Implementation. Hybrid context retrieval 1 day, 4 hours ago | www.reddit.com

api context cosine embedding +14

[D] limiting LLM output to certain words 1 day, 5 hours ago | www.reddit.com

apple class classification engineer +10

Lead Developer (AI)

@ Cere Network | San Francisco, US

View on ai-jobs.net

Research Engineer

@ Allora Labs | Remote

View on ai-jobs.net

Ecosystem Manager

@ Allora Labs | Remote

View on ai-jobs.net

Founding AI Engineer, Agents

@ Occam AI | New York

View on ai-jobs.net

AI Engineer Intern, Agents

@ Occam AI | US

View on ai-jobs.net

AI Research Scientist

@ Vara | Berlin, Germany and Remote

View on ai-jobs.net