QLoRA is all you need (Fast and lightweight model fine-tuning) | allainews.com

Sept. 15, 2023, 3:20 p.m. | sentdex

sentdex www.youtube.com

Learning and sharing my process with QLoRA (quantized low rank adapters) fine-tuning. In this case, I use a custom-made reddit dataset, but you can use anything you want.

I referenced a LOT of stuff in this video, I will do my best to link everything, but let me know if I forget anything.

Resources:
WSB-GPT-7B Model: https://huggingface.co/Sentdex/WSB-GPT-7B
WSB-GPT-13B Model: https://huggingface.co/Sentdex/WSB-GPT-13B
WSB Training data: https://huggingface.co/datasets/Sentdex/wsb_reddit_v002

Code:
QLoRA Repo: https://github.com/artidoro/qlora
qlora.py: https://github.com/artidoro/qlora/blob/main/qlora.py
Simple qlora training notebook: https://colab.research.google.com/drive/1VoYNfYDKcKRQRor98Zbf2-9VQTtGJ24k?usp=sharing
qlora merging/dequantizing code: https://gist.github.com/ChrisHayduk/1a53463331f52dca205e55982baf9930

Referenced …

case dataset everything fine-tuning low model fine-tuning process qlora reddit video

More from www.youtube.com / sentdex

Building an LLM fine-tuning Dataset 1 month, 4 weeks ago | www.youtube.com

archives bigquery building contents +10

Visualizing Neural Network Internals 2 months, 3 weeks ago | www.youtube.com

inference network neural network training

Getting Back on Grid 2 months, 4 weeks ago | www.youtube.com

bridge buildings grid implementation +7

Open Source AI Inference API w/ Together 4 months, 1 week ago | www.youtube.com

INFINITE Inference Power for AI 4 months, 2 weeks ago | www.youtube.com

Pandas Dataframes on your GPU w/ CuDF 5 months, 3 weeks ago | www.youtube.com

accelerator analysis cudf data +8

QLoRA is all you need (Fast and lightweight model fine-tuning) 7 months, 2 weeks ago | www.youtube.com

case dataset everything fine-tuning +6

Chat Interface for your Local Llama LLMs 8 months, 1 week ago | www.youtube.com

chat gradio huggingface interfaces +8

Gzip is all You Need! (This SHOULD NOT work) 9 months, 1 week ago | www.youtube.com

Founding AI Engineer, Agents

@ Occam AI | New York

View on ai-jobs.net

AI Engineer Intern, Agents

@ Occam AI | US

View on ai-jobs.net

AI Research Scientist

@ Vara | Berlin, Germany and Remote

View on ai-jobs.net

Data Architect

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

View on ai-jobs.net