Fine-tuning LLM on a laptop: VRAM - Shared Memory - GPU Load - Performance

April 9, 2024, 1:56 p.m. | Maxim Saplin

DEV Community dev.to

I have been playing with Supervised Fine Tuning and LORA using my laptop with NVIDIA RTX 4060 8GB. The subject of SFT is vast, picking the correct training hyperparams is more magic than science, and there's a good deal of experimentation...

Yet let me share one small finding. GPU utilization and Shared Memory effect on training speed.

I used Stable LM 2 1.6B base model and turned it into a chat model using 4400 samples from OASTT2 dataset. Here is …

ai deal experimentation fine-tuning genai good gpu laptop llm lora machinelearning magic memory nvidia nvidia rtx performance playing programming rtx rtx 4060 science sft training vast

Visit resource

More from dev.to / DEV Community

🖋️ Unlock Your Writing Potential with CopilotKit's AI-Powered Wizardry! 12 minutes ago | dev.to

ai ai-powered article challenge +12

Melhorando e configurando seu novo Shell linux. Pt-2 31 minutes ago | dev.to

linux shell windows wsl

Computer Vision Meetup: Who needs RLHF When You Have SFT? an hour ago | dev.to

academia ai center computer +24

Exploring the Practical Applications of ChatGPT in Everyday Life an hour ago | dev.to

ai applications chatgpt emails +17

Computer Vision Meetup: Making LLMs Safe & Reliable 2 hours ago | dev.to

ai attacks capabilities computer +29

Computer Vision Meetup: Develop a Legal Search Application from Scratch using Milvus and DSPy! 2 hours ago | dev.to

ai alternative application case +24

Elixir Days - Evento presencial em São Paulo 2 hours ago | dev.to

dias elixir erlang

Revolutionize Your Content with AI Article Rewriting in Python! 2 hours ago | dev.to

advanced ai article article articles +8

Hindi-Language AI Chatbot for Enterprises Using Qdrant, MLFlow, and LangChain 3 hours ago | dev.to

ai chatbot ai-powered ai-powered chatbots become +25

AI Research Scientist

@ Vara | Berlin, Germany and Remote

View on ai-jobs.net

Data Architect

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

View on ai-jobs.net

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

View on ai-jobs.net

Senior Data Scientist

@ ITE Management | New York City, United States

View on ai-jobs.net

View more jobs

all AI news

Fine-tuning LLM on a laptop: VRAM - Shared Memory - GPU Load - Performance

More from dev.to / DEV Community

Jobs in AI, ML, Big Data

AI Research Scientist

Data Architect

Data ETL Engineer

Lead GNSS Data Scientist

Senior Machine Learning Engineer (MLOps)

Senior Data Scientist