all AI news
QLoRA is all you need (Fast and lightweight model fine-tuning)
Sept. 15, 2023, 3:20 p.m. | sentdex
sentdex www.youtube.com
I referenced a LOT of stuff in this video, I will do my best to link everything, but let me know if I forget anything.
Resources:
WSB-GPT-7B Model: https://huggingface.co/Sentdex/WSB-GPT-7B
WSB-GPT-13B Model: https://huggingface.co/Sentdex/WSB-GPT-13B
WSB Training data: https://huggingface.co/datasets/Sentdex/wsb_reddit_v002
Code:
QLoRA Repo: https://github.com/artidoro/qlora
qlora.py: https://github.com/artidoro/qlora/blob/main/qlora.py
Simple qlora training notebook: https://colab.research.google.com/drive/1VoYNfYDKcKRQRor98Zbf2-9VQTtGJ24k?usp=sharing
qlora merging/dequantizing code: https://gist.github.com/ChrisHayduk/1a53463331f52dca205e55982baf9930
Referenced …
case dataset everything fine-tuning low model fine-tuning process qlora reddit video
More from www.youtube.com / sentdex
Building an LLM fine-tuning Dataset
1 month, 4 weeks ago |
www.youtube.com
Visualizing Neural Network Internals
2 months, 3 weeks ago |
www.youtube.com
Getting Back on Grid
2 months, 4 weeks ago |
www.youtube.com
Open Source AI Inference API w/ Together
4 months, 1 week ago |
www.youtube.com
INFINITE Inference Power for AI
4 months, 2 weeks ago |
www.youtube.com
Pandas Dataframes on your GPU w/ CuDF
5 months, 3 weeks ago |
www.youtube.com
QLoRA is all you need (Fast and lightweight model fine-tuning)
7 months, 2 weeks ago |
www.youtube.com
Chat Interface for your Local Llama LLMs
8 months, 1 week ago |
www.youtube.com
Gzip is all You Need! (This SHOULD NOT work)
9 months, 1 week ago |
www.youtube.com
Jobs in AI, ML, Big Data
Founding AI Engineer, Agents
@ Occam AI | New York
AI Engineer Intern, Agents
@ Occam AI | US
AI Research Scientist
@ Vara | Berlin, Germany and Remote
Data Architect
@ University of Texas at Austin | Austin, TX
Data ETL Engineer
@ University of Texas at Austin | Austin, TX
Lead GNSS Data Scientist
@ Lurra Systems | Melbourne