Need help in understanding time and GPU memory required for finetuning a LLM | allainews.com

Oct. 19, 2023, 6:13 a.m. | /u/Former_Goose_6894

Deep Learning www.reddit.com

I want to fine tune, models like llama-70b, code Alpaca or Starcoder for my use case. I actually have a dataset of 10,000 dockerfiles and want to finetune a model on this dataset.

What can be the expected time needed for this kind of training and what GPU should I use? Also, will the requirements differ for just using the model than training?

Ps: I am very new to the world of deep learning, LLMs and GPUs. Any help would …

alpaca case code dataset deeplearning finetuning gpu kind llama llm memory starcoder training understanding

More from www.reddit.com / Deep Learning

The Vibe I get from the KAN paper 1 day, 6 hours ago | www.reddit.com

cases deeplearning fun grid +5

Kolmogorov-Arnold Networks (KANs): A Promising Alternative for Better Accuracy and Interpretability in Deep Learning 2 days, 6 hours ago | www.reddit.com

accuracy alternative deep learning deeplearning +2

State of the Art Transfer Anything is IDM-VTON - based on Stable Diffusion 2 days, 7 hours ago | www.reddit.com

art deeplearning diffusion stable diffusion +3

What's your opinions about KAN? 2 days, 10 hours ago | www.reddit.com

deeplearning opinions

I have a hard time understanding the maths behind AI but I also want to … 3 days, 1 hour ago | www.reddit.com

deeplearning maths suggestions understanding

if i was a freelancer deep learning engineer and i work with a company will … 3 days, 14 hours ago | www.reddit.com

computer deep learning deeplearning engineer +6

What does Speaker Embeddings consists of? 3 days, 18 hours ago | www.reddit.com

architecture deeplearning embeddings lstm +2

Physics-Based Deep Learning: Insights into Physics-Informed Neural Networks (PINNs) 4 days, 11 hours ago | www.reddit.com

deep learning deeplearning insights networks +3

How would one write the following loss function in python? I am currently stuck on … 5 days ago | www.reddit.com

deeplearning function loss python

Founding AI Engineer, Agents

@ Occam AI | New York

View on ai-jobs.net

AI Engineer Intern, Agents

@ Occam AI | US

View on ai-jobs.net

AI Research Scientist

@ Vara | Berlin, Germany and Remote

View on ai-jobs.net

Data Architect

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

View on ai-jobs.net