all AI news
New Tutorial on LLM Quantization w/ QLoRA, GPTQ and Llamacpp, LLama 2
Sept. 9, 2023, noon | code_your_own_AI
code_your_own_AI www.youtube.com
llama.cpp - ggml.c - GGUL - C++
Compare to HF transformers in 4-bit quantization.
Download Web UI wrappers for your heavily quantized LLM to your local machine (PC, Linux, Apple).
LLM on Apple Hardware, w/ M1, M2 or M3 chip.
Run inference of your LLMs on your local PC, with heavy quantization applied.
Plus: 8 Web UI for GTPQ, llama.cpp or AutoGPTQ, exLLama or GGUF.c
koboldcpp
oobabooga text-generation-webui
ctransformers
https://lmstudio.ai/
https://github.com/marella/ctransformers
https://github.com/ggerganov/ggml
https://github.com/rustformers/llm/blob/main/crates/ggml/README.md
https://huggingface.co/TheBloke/Llama-2-13B-chat-GGML/blob/main/README.md
https://github.com/PanQiWei/AutoGPTQ …
apple chip cpp download hardware inference linux llama llama 2 llm machine quantization transformers tutorial web
More from www.youtube.com / code_your_own_AI
Multi-Token Prediction (forget next token LLM?)
1 day, 2 hours ago |
www.youtube.com
NEW LLM Test: Reasoning & gpt2-chatbot
2 days, 7 hours ago |
www.youtube.com
LLMs: Rewriting Our Tomorrow (plus code) #ai
3 days, 14 hours ago |
www.youtube.com
Autonomous AI Agents: 14 % MAX Performance
5 days, 2 hours ago |
www.youtube.com
No more Fine-Tuning: Unsupervised ICL+
1 week, 1 day ago |
www.youtube.com
NEW Phi-3 mini 3.8B LLM for Your PHONE: 1st TEST
1 week, 2 days ago |
www.youtube.com
BEST LLMs for Coding, Long Context, Overall Perform
1 week, 3 days ago |
www.youtube.com
Jobs in AI, ML, Big Data
AI Research Scientist
@ Vara | Berlin, Germany and Remote
Data Architect
@ University of Texas at Austin | Austin, TX
Data ETL Engineer
@ University of Texas at Austin | Austin, TX
Lead GNSS Data Scientist
@ Lurra Systems | Melbourne
Senior Machine Learning Engineer (MLOps)
@ Promaton | Remote, Europe
Senior Machine Learning Engineer
@ Samsara | Canada - Remote