March 11, 2024, 2:02 p.m. | Marie Stephen Leo

Towards AI - Medium pub.towardsai.net

Reduce LLM latency and cost by over 95% using OpenAI, LiteLLM, Qdrant, and Sentence Transformers!

ai chatbots artificial intelligence caching chatbots cost data science generative generative ai chatbots latency llm machine learning openai programming qdrant reading reduce semantic technology transformers

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

Lead Developer (AI)

@ Cere Network | San Francisco, US

Research Engineer

@ Allora Labs | Remote

Ecosystem Manager

@ Allora Labs | Remote

Founding AI Engineer, Agents

@ Occam AI | New York