How to Serve LLM Completions in Production

Jan. 18, 2024, 9:19 p.m. | Mateusz Charytoniuk

DEV Community dev.to

Preparations

To start, you need to compile llama.cpp. You can follow their README for instructions.

The server is compiled alongside other targets by default.

Once you have the server running, we can continue. We will use PHP Resonance framework.

Troubleshooting

Obtaining Open-Source LLM

I recommend starting either with llama2 or Mistral. You need to download the pretrained weights and convert them into GGUF format before they can be used with llama.cpp.

Starting Server Without a GPU

llama.cpp …

ai cpp framework llama llama2 llm mistral php production readme running serve server targets troubleshooting webdev will

Visit resource

More from dev.to / DEV Community

Animated Navigation Menu 39 minutes ago | dev.to

animated animation click css +8

MySQL Performance Monitoring and Query Analysis an hour ago | dev.to

analysis analyze backend database +15

Introduction to Database Connectivity with Go (SQL and NoSQL) an hour ago | dev.to

application building connectivity database +12

Which AI feature would you want MOST from your project management tool? an hour ago | dev.to

ai customers devops feature +15

FLaNK-AIM: 20 May 2024 Weekly 2 hours ago | dev.to

aim apacheflink apachekafka apachenifi +19

I'm Building an AI-Powered Blog: Here's How... 2 hours ago | dev.to

ai era ai-powered autocomplete beginners +14

From Monolithic Chaos to Hybrid Harmony: The Tale of CompanyXYZ's Transformation 2 hours ago | dev.to

ai alex chaos chatgpt +11

Agent Cloud vs Qdrant 3 hours ago | dev.to

accuracy advancement agent ai +23

What Are the Best AI Writing Tools for Productivity in 2024? 3 hours ago | dev.to

ai assistants become editing +10

Software Engineer for AI Training Data (School Specific)

@ G2i Inc | Remote

View on ai-jobs.net

Software Engineer for AI Training Data (Python)

@ G2i Inc | Remote

View on ai-jobs.net

Software Engineer for AI Training Data (Tier 2)

@ G2i Inc | Remote

View on ai-jobs.net

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

View on ai-jobs.net

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

View on ai-jobs.net

Lead Developer (AI)

@ Cere Network | San Francisco, US

View on ai-jobs.net

all AI news

How to Serve LLM Completions in Production

Preparations

Troubleshooting

Obtaining Open-Source LLM

Starting Server Without a GPU

More from dev.to / DEV Community

Jobs in AI, ML, Big Data

Software Engineer for AI Training Data (School Specific)

Software Engineer for AI Training Data (Python)

Software Engineer for AI Training Data (Tier 2)

Data Engineer

Artificial Intelligence – Bioinformatic Expert

Lead Developer (AI)