Mistral AI's Open-Source Mixtral 8x7B Outperforms GPT-3.5 | allainews.com

Jan. 23, 2024, 2 p.m. | Anthony Alford

InfoQ - AI, ML & Data Engineering www.infoq.com

Mistral AI recently released Mixtral 8x7B, a sparse mixture of experts (SMoE) large language model (LLM). The model contains 46.7B total parameters, but performs inference at the same speed and cost as models one-third that size. On several LLM benchmarks, it outperformed both Llama 2 70B and GPT-3.5, the model powering ChatGPT.

By Anthony Alford

ai benchmarks chatgpt cost deep learning experts generative-ai gpt gpt-3 gpt-3.5 inference language language model large language large language model large language models llama llama 2 llm llm benchmarks mistral mistral ai mixtral mixtral 8x7b mixture of experts ml & data engineering neural networks parameters speed total

More from www.infoq.com / InfoQ - AI, ML & Data Engineering

Cloudflare AI Gateway Now Generally Available 22 hours ago | www.infoq.com

ai ai applications ai workloads applications +17

University of Washington AI-Powered Headphones Let Users Listen to a Single Person in a Crowd 2 days, 12 hours ago | www.infoq.com

ai ai-powered algorithm artificial intelligence +14

Presentation: Retrieval-Augmented Generation (RAG) Patterns and Best Practices 3 days, 17 hours ago | www.infoq.com

ai best practices large language models ml & data engineering +11

JLama: The First Pure Java Model Inference Engine Implemented With Vector API and Project Panama 4 days, 16 hours ago | www.infoq.com

ai andrej karpathy api decision +15

Stanford AI Index 2024 Report: Growth of AI Regulations and Generative AI Investment 5 days, 14 hours ago | www.infoq.com

ai ai investment ai regulations anthony +20

NIST Launches Program to Discriminate How Far From "Human-Quality" Are Gen AI Generated Summaries 5 days, 22 hours ago | www.infoq.com

ai ai generated architecture & design community +28

Java News Roundup: Java Turns 29, Kotlin 2.0, Semantic Kernel for Java 1.0, More OpenJDK … 6 days, 14 hours ago | www.infoq.com

ai architecture & design birthday development +22

Spring Ecosystem Releases Focus on Spring Boot, Spring Session and Spring Security 1 week ago | www.infoq.com

ai architecture & design boot development +17

Presentation: Understanding Architectures for Multi-Region Data Residency 1 week, 2 days ago | www.infoq.com

ai alex architecture & design architectures +13

Senior Machine Learning Engineer

@ GPTZero | Toronto, Canada

View on ai-jobs.net

ML/AI Engineer / NLP Expert - Custom LLM Development (x/f/m)

@ HelloBetter | Remote

View on ai-jobs.net

Doctoral Researcher (m/f/div) in Automated Processing of Bioimages

@ Leibniz Institute for Natural Product Research and Infection Biology (Leibniz-HKI) | Jena

View on ai-jobs.net

Seeking Developers and Engineers for AI T-Shirt Generator Project

@ Chevon Hicks | Remote

View on ai-jobs.net

Senior Applied Data Scientist

@ dunnhumby | London

View on ai-jobs.net

Principal Data Architect - Azure & Big Data

@ MGM Resorts International | Home Office - US, NV

View on ai-jobs.net