Oct. 31, 2023, 12:35 p.m. | /u/timegentlemenplease_

Machine Learning www.reddit.com

Hi /r/machinelearning! I've been working with my collaborators on a site where you can compare OpenAI models to get a sense of the improvement over time of the models: [https://theaidigest.org/progress-and-dangers](https://theaidigest.org/progress-and-dangers)

https://preview.redd.it/khruhgkp7jxb1.png?width=1960&format=png&auto=webp&s=21d13125145f7fae7351686d4078868d65cbf8c3

It includes a number of things that you might be interested in:

* You can ask any question and compare the outputs from the OpenAI models:

https://preview.redd.it/s5e9acev8jxb1.png?width=1458&format=png&auto=webp&s=0c3e5ba3661fccfc4f4ba60db346b6142b1e52f3

* Visualises OpenAI models benchmark performance across 22 benchmarks:

https://preview.redd.it/vhai63308jxb1.png?width=1948&format=png&auto=webp&s=07f65f131b2e6d5122400120a11d24205b7d08d6

* Shows examples of benchmark outputs for GPT-2 to GPT-4

https://preview.redd.it/f3p7ni068jxb1.png?width=1980&format=png&auto=webp&s=dfe25c8c4a486a0df3c4cce2e4497fd250163bd1

* …

benchmark benchmarks capabilities example examples gpt gpt-2 gpt-4 machinelearning openai openai models performance shows weapons

Founding AI Engineer, Agents

@ Occam AI | New York

AI Engineer Intern, Agents

@ Occam AI | US

AI Research Scientist

@ Vara | Berlin, Germany and Remote

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne