[D] [P] Stockpile of GPU Servers | allainews.com

Jan. 19, 2024, 3:27 p.m. | /u/TheRealBracketMaster

Machine Learning www.reddit.com

I have a stockpile of 22 8 GPUs servers with AMD Mi50s(see notes about Mi50s below). I've been able to get PyTorch working on these GPUs and have been able to do inference for different large language models. I originally wanted to use these GPUs to serve up LLMs, but VLLM cuda kernels don't work out of the box with the Mi50s, and Llama CPP has a bug where it only supports up to 4 AMD GPUs at once.

So …

amd cuda gpu gpus inference language language models large language large language models llms machinelearning notes pytorch serve servers

More from www.reddit.com / Machine Learning

[D] Llama 3 Monstrosities 2 hours ago | www.reddit.com

create easy life llama +4

[P] LeRobot: Hugging Face's library for real-world robotics 8 hours ago | www.reddit.com

academia advanced advanced ai ai development +13

[D] Kolmogorov-Arnold Network is just an MLP 9 hours ago | www.reddit.com

machinelearning mlp network relu +1

[D] Why Gemma has such crazy big MLP hidden dim size? 9 hours ago | www.reddit.com

big gemma hidden machinelearning +1

[R] Why can Llama-3 work with 32K context if it only had 8K context length? 10 hours ago | www.reddit.com

32k context config context dynamic +7

[D] Is there a formal name for "dialogue classification?" 16 hours ago | www.reddit.com

agents classification customer customer service +11

How Large Language Models play video games [D] 17 hours ago | www.reddit.com

agents case engineering explore +15

[Project] An LLM-Powered Web App for SEC Filing Insights 17 hours ago | www.reddit.com

apis app financial future +18

[Research] Understanding The Attention Mechanism In Transformers: A 5-minute visual guide. 🧠 21 hours ago | www.reddit.com

architectures attention dictionary guide +12

Founding AI Engineer, Agents

@ Occam AI | New York

View on ai-jobs.net

AI Engineer Intern, Agents

@ Occam AI | US

View on ai-jobs.net

AI Research Scientist

@ Vara | Berlin, Germany and Remote

View on ai-jobs.net

Data Architect

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Sr. Software Development Manager, AWS Neuron Machine Learning Distributed Training

@ Amazon.com | Cupertino, California, USA

View on ai-jobs.net