[D] Gemma vs Mistral (and other open models) | allainews.com

Feb. 21, 2024, 8:01 p.m. | /u/InevitableSky2801

Machine Learning www.reddit.com

A lot of comparisons out there in terms of model size and quantitative performance against benchmarks. Wanted to get people's thoughts on qualitative, ad-hoc performance.

\> Playground to Compare: [https://huggingface.co/spaces/lastmileai/gemma-playground](https://huggingface.co/spaces/lastmileai/gemma-playground)

In this example, both Gemma 2B and 7B doesn't seem to do well against Mistral for CoT tasks. Curious how it's doing for instruct, question-answering, and creativity tasks.

benchmarks creativity example gemma machinelearning mistral open models people performance quantitative question tasks terms thoughts

More from www.reddit.com / Machine Learning

[D] How did OpenAI go from doing exciting research to a big-tech-like company? an hour ago | www.reddit.com

capabilities engineering fast forward gpt4 +6

[D] Culture of Recycling Old Conference Submissions in ML 4 hours ago | www.reddit.com

conference conferences culture iclr +10

[D] How Do You Efficiently Conduct Ablation Studies in Machine Learning? 4 hours ago | www.reddit.com

fine-tuning grid insights machine +7

[P] N-way-attention 8 hours ago | www.reddit.com

algorithm attention concept every +12

[D] Is it possible to train ViTMAE with Hyperspectral Satellite Images? 18 hours ago | www.reddit.com

encoder format images learn +4

[D] Mamba Convergence speed 21 hours ago | www.reddit.com

class convergence dataset example +10

[P] Local RAG with RETSim, Ollama and Gemma 23 hours ago | www.reddit.com

gemma machinelearning notebooks ollama +3

[Project] Tabletop HandyBot: low-cost robotic arm assistant for tabletop tasks 1 day, 1 hour ago | www.reddit.com

arm assistant cost functional +9

[R] Grounding DINO 1.5 Release: the most capable open-set detection model 1 day, 2 hours ago | www.reddit.com

building dataset detection foundation +12

Software Engineer for AI Training Data (School Specific)

@ G2i Inc | Remote

View on ai-jobs.net

Software Engineer for AI Training Data (Python)

@ G2i Inc | Remote

View on ai-jobs.net

Software Engineer for AI Training Data (Tier 2)

@ G2i Inc | Remote

View on ai-jobs.net

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

View on ai-jobs.net

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

View on ai-jobs.net

Lead Developer (AI)

@ Cere Network | San Francisco, US

View on ai-jobs.net