CLI tool to benchmark 100+LLMs response, response time, cost | allainews.com

Sept. 8, 2023, 4:01 p.m. | /u/Comfortable_Dirt5590

Natural Language Processing www.reddit.com

Hi r/LanguageTechnology ,I built a CLI tool to benchmark 100+ LLMs for a given question. Benchmark output allows you to compare responses, response time and cost.

Try it here: [https://github.com/BerriAI/litellm/blob/main/cookbook/benchmark/readme.md](https://github.com/BerriAI/litellm/blob/main/cookbook/benchmark/readme.md)

**CLI Output:**

|Model|Response|Response Time|Cost|
|:-|:-|:-|:-|
|gpt-3.5|As an AI language model, I cannot provide up-to-date information or predict|2.1|$0.000122|
|claude-2|I'm not able to provide information about future IPO plans or dates for BerriAI|3.2|$0.0010142|
|llama2|I do not have any information about when or if BerriAI will have an initial|0.01|$0.000522|

Simply select your LLMs, …

ai language model benchmark claude cli cost future gpt gpt-3 gpt-3.5 information language language model languagetechnology llms responses tool

More from www.reddit.com / Natural Language Processing

How big does a dataset have to be to fine-tune a transformer model for NER. 1 day, 2 hours ago | www.reddit.com

bert big database dataset +15

PhD in Linguistics: Which skills should I focus on? 1 day, 18 hours ago | www.reddit.com

communication computer computer science fields +12

Is the MA in computational linguistics that bad in Tubingen ? 2 days, 2 hours ago | www.reddit.com

computational languagetechnology linguistics

Which NLP-master programs in Europe are more cs-leaning? 5 days, 20 hours ago | www.reddit.com

computational english europe germany +12

What do you think is the state of the art technique for matching a piece … 1 week ago | www.reddit.com

art city database example +9

Multilabel text classification on unlabled data 1 week, 1 day ago | www.reddit.com

classification data finance isn +11

I made a text-game where all the LLMs trick each other pretending to be humans. … 1 week, 1 day ago | www.reddit.com

game humans languagetechnology llms +3

Help with fraud recognition 1 week, 2 days ago | www.reddit.com

bank code country detection +7

AI-proof language-related jobs in the United States? 1 week, 3 days ago | www.reddit.com

jobs language languagetechnology management +4

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

View on ai-jobs.net

Lead Developer (AI)

@ Cere Network | San Francisco, US

View on ai-jobs.net

Research Engineer

@ Allora Labs | Remote

View on ai-jobs.net

Ecosystem Manager

@ Allora Labs | Remote

View on ai-jobs.net

Founding AI Engineer, Agents

@ Occam AI | New York

View on ai-jobs.net

AI Engineer Intern, Agents

@ Occam AI | US

View on ai-jobs.net