Deploy (Tiny) LLM to Production: Merge Lora Adapter, Push to HF Hub, Rest API with FastAPI & Docker | allainews.com

March 4, 2024, 9:30 p.m. | Venelin Valkov

Venelin Valkov www.youtube.com

Full text tutorial (requires MLExpert Pro): https://www.mlexpert.io/bootcamp/deploy-custom-llm-to-production

You have a fine-tuned model (with LoRA adapter) to deploy as a REST API? In this video, we'll merge a LoRA adapter with a base model and upload it (with a tokenizer) to HuggingFace Hub. We'll build a REST API with FastAPI and deploy it as a Docker container.

Model on HuggingFace Hub: https://huggingface.co/curiousily/tiny-crypto-sentiment-analysis
HuggingFace Space: https://huggingface.co/spaces/curiousily/tiny-crypto-sentiment
API Docs: https://curiousily-tiny-crypto-sentiment.hf.space/docs

AI Bootcamp (in preview): https://www.mlexpert.io/membership
Discord: https://discord.gg/UaNPxVD6tv
Subscribe: http://bit.ly/venelin-subscribe
GitHub repository: https://github.com/curiousily/Get-Things-Done-with-Prompt-Engineering-and-LangChain

00:00 …

api build deploy docker docker container fastapi hub huggingface intro lora merge rest rest api text tutorial video

More from www.youtube.com / Venelin Valkov

Run Your Own AI (Mixtral) on Your Machine - Inference using Llamacpp on a Cloud … 2 weeks, 1 day ago | www.youtube.com

ai system cloud control cpp +18

Build Real-World Machine Learning Project: Step-by-Step Guide using FastAPI, DVC & Poetry 3 weeks ago | www.youtube.com

api build building data +17

Grok-1 Open Source: 314B Mixture-of-Experts Model by xAI | Blog post, GitHub/Source Code 1 month, 1 week ago | www.youtube.com

architecture blog code experts +8

Real-World PyTorch: From Zero to Hero in Deep Learning & LLMs | Tensors, Operations, Model … 1 month, 1 week ago | www.youtube.com

advanced basics data deep learning +19

Will AI Take Your Job? Should You Learn Programming and AI/ML Development in 2024 and … 1 month, 1 week ago | www.youtube.com

artificialintelligence beyond chatgpt development +12

Deploy (Tiny) LLM to Production: Merge Lora Adapter, Push to HF Hub, Rest API with … 1 month, 3 weeks ago | www.youtube.com

api build deploy docker +12

Fine-tuning Tiny LLM on Your Data | Sentiment Analysis with TinyLlama and LoRA on a … 3 months ago | www.youtube.com

analysis data dataset fine-tuning +15

Mamba vs. Transformers: The Future of LLMs? | Paper Overview & Google Colab Code & … 3 months, 3 weeks ago | www.youtube.com

architecture chat code colab +18

Key Principles for Optimizing LLaMA 2 & ChatGPT Responses | Mastering AI Prompt Engineering 3 months, 3 weeks ago | www.youtube.com

breaking chatgpt chatgpt responses engineering +12

Data Architect

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

View on ai-jobs.net

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

View on ai-jobs.net

Research Scientist (Computer Science)

@ Nanyang Technological University | NTU Main Campus, Singapore

View on ai-jobs.net

Intern - Sales Data Management

@ Deliveroo | Dubai, UAE (Main Office)

View on ai-jobs.net