Fine-tuning LLM on a laptop: VRAM - Shared Memory - GPU Load - Performance

April 9, 2024, 1:56 p.m. | Maxim Saplin

DEV Community dev.to

I have been playing with Supervised Fine Tuning and LORA using my laptop with NVIDIA RTX 4060 8GB. The subject of SFT is vast, picking the correct training hyperparams is more magic than science, and there's a good deal of experimentation...

Yet let me share one small finding. GPU utilization and Shared Memory effect on training speed.

I used Stable LM 2 1.6B base model and turned it into a chat model using 4400 samples from OASTT2 dataset. Here is …

ai deal experimentation fine-tuning genai good gpu laptop llm lora machinelearning magic memory nvidia nvidia rtx performance playing programming rtx rtx 4060 science sft training vast

Visit resource

More from dev.to / DEV Community

Unleashing the Hidden Power of HTML Forms an hour ago | dev.to

beyond collection data features +13

The magical number game 2 hours ago | dev.to

challenge chance coding game +6

AI as Your Strategic Partner: Building a Virtual Chief Strategy Officer 2 hours ago | dev.to

ai artificial artificial intelligence building +20

Complete portfolio using HTML 2 hours ago | dev.to

check codepen html portfolio +2

LangChain: Document Loading 3 hours ago | dev.to

ai applications blog capabilities +25

🔥 10x the odds of landing a Go job 🔥 3 hours ago | dev.to

beginners developer developers face +11

Why I Stay with Serverless in 2024 😎 4 hours ago | dev.to

aws aws lambda cloud code +7

How to enable DEBUG logs in Flyway? 4 hours ago | dev.to

application code debug file +7

Google Colab With Open AI 6 hours ago | dev.to

access cloud cloud-based code +18

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

View on ai-jobs.net

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

View on ai-jobs.net

Lead Developer (AI)

@ Cere Network | San Francisco, US

View on ai-jobs.net

Research Engineer

@ Allora Labs | Remote

View on ai-jobs.net

Ecosystem Manager

@ Allora Labs | Remote

View on ai-jobs.net

Founding AI Engineer, Agents

@ Occam AI | New York

View on ai-jobs.net

View more jobs

all AI news

Fine-tuning LLM on a laptop: VRAM - Shared Memory - GPU Load - Performance

More from dev.to / DEV Community

Jobs in AI, ML, Big Data

Data Engineer

Artificial Intelligence – Bioinformatic Expert

Lead Developer (AI)

Research Engineer

Ecosystem Manager

Founding AI Engineer, Agents