QLoRA is all you need (Fast and lightweight model fine-tuning) | allainews.com

Sept. 15, 2023, 3:20 p.m. | sentdex

sentdex www.youtube.com

Learning and sharing my process with QLoRA (quantized low rank adapters) fine-tuning. In this case, I use a custom-made reddit dataset, but you can use anything you want.

I referenced a LOT of stuff in this video, I will do my best to link everything, but let me know if I forget anything.

Resources:
WSB-GPT-7B Model: https://huggingface.co/Sentdex/WSB-GPT-7B
WSB-GPT-13B Model: https://huggingface.co/Sentdex/WSB-GPT-13B
WSB Training data: https://huggingface.co/datasets/Sentdex/wsb_reddit_v002

Code:
QLoRA Repo: https://github.com/artidoro/qlora
qlora.py: https://github.com/artidoro/qlora/blob/main/qlora.py
Simple qlora training notebook: https://colab.research.google.com/drive/1VoYNfYDKcKRQRor98Zbf2-9VQTtGJ24k?usp=sharing
qlora merging/dequantizing code: https://gist.github.com/ChrisHayduk/1a53463331f52dca205e55982baf9930

Referenced …

case dataset everything fine-tuning low model fine-tuning process qlora reddit video

More from www.youtube.com / sentdex

Building an LLM fine-tuning Dataset 2 months, 3 weeks ago | www.youtube.com

archives bigquery building contents +10

Visualizing Neural Network Internals 3 months, 2 weeks ago | www.youtube.com

inference network neural network training

Getting Back on Grid 3 months, 3 weeks ago | www.youtube.com

bridge buildings grid implementation +7

Open Source AI Inference API w/ Together 5 months, 1 week ago | www.youtube.com

INFINITE Inference Power for AI 5 months, 2 weeks ago | www.youtube.com

Pandas Dataframes on your GPU w/ CuDF 6 months, 3 weeks ago | www.youtube.com

accelerator analysis cudf data +8

QLoRA is all you need (Fast and lightweight model fine-tuning) 8 months, 2 weeks ago | www.youtube.com

case dataset everything fine-tuning +6

Chat Interface for your Local Llama LLMs 9 months, 1 week ago | www.youtube.com

chat gradio huggingface interfaces +8

Gzip is all You Need! (This SHOULD NOT work) 10 months ago | www.youtube.com

Senior Machine Learning Engineer

@ GPTZero | Toronto, Canada

View on ai-jobs.net

ML/AI Engineer / NLP Expert - Custom LLM Development (x/f/m)

@ HelloBetter | Remote

View on ai-jobs.net

Doctoral Researcher (m/f/div) in Automated Processing of Bioimages

@ Leibniz Institute for Natural Product Research and Infection Biology (Leibniz-HKI) | Jena

View on ai-jobs.net

Seeking Developers and Engineers for AI T-Shirt Generator Project

@ Chevon Hicks | Remote

View on ai-jobs.net

Principal Data Architect - Azure & Big Data

@ MGM Resorts International | Home Office - US, NV

View on ai-jobs.net

GN SONG MT Market Research Data Analyst 11

@ Accenture | Bengaluru, BDC7A

View on ai-jobs.net