[P] Rust meets Llama2: OpenAI compatible API written in Rust | allainews.com

Aug. 6, 2023, 10:06 p.m. | /u/amindiro

Machine Learning www.reddit.com

Hello,

I have been working on an OpenAI-compatible API for serving LLAMA-2 models written entirely in Rust. It supports offloading computation to Nvidia GPU and Metal acceleration for GGML models !

Here is the project link: [Cria- Local LLAMA2 API](https://github.com/AmineDiro/cria)

You can use it as an OpenAI replacement (check out the included \`Langchain\` example in the project).

This is an ongoing project, I have implemented the \`embeddings\` and \`completions\` routes. The \`chat-completion\` route will be here very soon!

Really interested …

api check computation example gpu langchain llama llama2 machinelearning metal nvidia nvidia gpu openai project replacement rust

More from www.reddit.com / Machine Learning

[P] I made a website that visualizes your codebase with LLMs 2 hours ago | www.reddit.com

codebase llms machinelearning website

[P] DARWIN - open-sourced Devin alternative 5 hours ago | www.reddit.com

access ai software ai software engineer alternative +16

[R] How Far Are We to GPT-4V? Closing the Gap to Commercial Multimodal Models with … 8 hours ago | www.reddit.com

abstract machinelearning

[R] Curvature-Informed SGD via General Purpose Lie-Group Preconditioners 8 hours ago | www.reddit.com

abstract algorithm approximation criterion +15

[P] A look at the latest major open LLM releases: Mixtral, Llama 3, Phi-3, and … 10 hours ago | www.reddit.com

latest llama llama 3 llm +8

[D] How do unets achieve spatial consistency? 11 hours ago | www.reddit.com

convolution create denoising hair +8

[D] Impact of solar storm on QLORA + RLHF of Llama3 8B? 13 hours ago | www.reddit.com

article control current experience +13

Can one use squared inverse of KL divergence as another divergence metric? [D] 13 hours ago | www.reddit.com

divergence light machinelearning

Feeling at a loss with all these transformer models from Hugging Face in NLP "[Discussion]" 17 hours ago | www.reddit.com

classification competition essay face +14

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

View on ai-jobs.net

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

View on ai-jobs.net

Lead Developer (AI)

@ Cere Network | San Francisco, US

View on ai-jobs.net

Research Engineer

@ Allora Labs | Remote

View on ai-jobs.net

Ecosystem Manager

@ Allora Labs | Remote

View on ai-jobs.net

Founding AI Engineer, Agents

@ Occam AI | New York

View on ai-jobs.net