Feb. 21, 2024, 8:01 p.m. | /u/InevitableSky2801

Machine Learning www.reddit.com

A lot of comparisons out there in terms of model size and quantitative performance against benchmarks. Wanted to get people's thoughts on qualitative, ad-hoc performance.

\> Playground to Compare: [https://huggingface.co/spaces/lastmileai/gemma-playground](https://huggingface.co/spaces/lastmileai/gemma-playground)

In this example, both Gemma 2B and 7B doesn't seem to do well against Mistral for CoT tasks. Curious how it's doing for instruct, question-answering, and creativity tasks.

benchmarks creativity example gemma machinelearning mistral open models people performance quantitative question tasks terms thoughts

Software Engineer for AI Training Data (School Specific)

@ G2i Inc | Remote

Software Engineer for AI Training Data (Python)

@ G2i Inc | Remote

Software Engineer for AI Training Data (Tier 2)

@ G2i Inc | Remote

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

Lead Developer (AI)

@ Cere Network | San Francisco, US