all AI news
Deploy (Tiny) LLM to Production: Merge Lora Adapter, Push to HF Hub, Rest API with FastAPI & Docker
March 4, 2024, 9:30 p.m. | Venelin Valkov
Venelin Valkov www.youtube.com
You have a fine-tuned model (with LoRA adapter) to deploy as a REST API? In this video, we'll merge a LoRA adapter with a base model and upload it (with a tokenizer) to HuggingFace Hub. We'll build a REST API with FastAPI and deploy it as a Docker container.
Model on HuggingFace Hub: https://huggingface.co/curiousily/tiny-crypto-sentiment-analysis
HuggingFace Space: https://huggingface.co/spaces/curiousily/tiny-crypto-sentiment
API Docs: https://curiousily-tiny-crypto-sentiment.hf.space/docs
AI Bootcamp (in preview): https://www.mlexpert.io/membership
Discord: https://discord.gg/UaNPxVD6tv
Subscribe: http://bit.ly/venelin-subscribe
GitHub repository: https://github.com/curiousily/Get-Things-Done-with-Prompt-Engineering-and-LangChain
00:00 …
api build deploy docker docker container fastapi hub huggingface intro lora merge rest rest api text tutorial video
More from www.youtube.com / Venelin Valkov
Jobs in AI, ML, Big Data
Software Engineer for AI Training Data (School Specific)
@ G2i Inc | Remote
Software Engineer for AI Training Data (Python)
@ G2i Inc | Remote
Software Engineer for AI Training Data (Tier 2)
@ G2i Inc | Remote
Data Engineer
@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania
Artificial Intelligence – Bioinformatic Expert
@ University of Texas Medical Branch | Galveston, TX
Lead Developer (AI)
@ Cere Network | San Francisco, US