[D] Best way and hassle free approach to loading T5 model on multiple GPUs? | allainews.com

Jan. 27, 2022, 7:50 p.m. | /u/rirhun

Machine Learning www.reddit.com

Does anybody know how to load this large model (https://github.com/google-research/text-to-text-transfer-transformer) on multiple GPUs. So load some layers on some GPUs and other layers on other GPUs on the same machine. This model needs about 100GB of memory and though I'm trying to use model.parallelize(), I still run into Cuda out of memory error.

Any assistance would be greatly appreciated!

submitted by /u/rirhun
[link] [comments]

gpus machinelearning

More from www.reddit.com / Machine Learning

[D] Llama-3 (7B and 70B) on a medical domain benchmark 7 hours ago | www.reddit.com

70b ai community benchmark community +10

[D] ICML Meta Reviews 8 hours ago | www.reddit.com

machinelearning

[R] Show Your Work with Confidence: Confidence Bands for Tuning Curves 9 hours ago | www.reddit.com

abstract accounting function hyperparameter +11

[R] InternVL v1.5 open sourced, ranking first in OpenCompass multi-modal benchmark 9 hours ago | www.reddit.com

benchmark cvpr demo download +7

[N] Meta releases Llama 3 9 hours ago | www.reddit.com

machinelearning

[R] Compression Represents Intelligence Linearly 10 hours ago | www.reddit.com

abstract advanced belief compression +13

[D] Product evaluations is one of the most under-discussed topics 10 hours ago | www.reddit.com

ai consultancy cases client consultancy +8

[D] 100+ labels text-classification problem. Whats the “usual” approach? Transformers? 11 hours ago | www.reddit.com

boosting classification data ensemble +10

[D] Training model on tabular data resulting in high loss 13 hours ago | www.reddit.com

context data function hello +7

Data Scientist (m/f/x/d)

@ Symanto Research GmbH & Co. KG | Spain, Germany

View on ai-jobs.net

Enterprise Data Architect

@ Pathward | Remote

View on ai-jobs.net

Diagnostic Imaging Information Systems (DIIS) Technologist

@ Nova Scotia Health Authority | Halifax, NS, CA, B3K 6R8

View on ai-jobs.net

Intern Data Scientist - Residual Value Risk Management (f/m/d)

@ BMW Group | Munich, DE

View on ai-jobs.net

Analytics Engineering Manager

@ PlayStation Global | United Kingdom, London

View on ai-jobs.net

Junior Insight Analyst (PR&Comms)

@ Signal AI | Lisbon, Lisbon, Portugal

View on ai-jobs.net