[D] Phi-3 models compared side-by-side. | allainews.com

May 23, 2024, 2:19 p.m. | /u/dark_surfer

Machine Learning www.reddit.com

https://preview.redd.it/8l04pnfhq62d1.png?width=661&format=png&auto=webp&s=7fe616ca8cd7da974070c86b6b47ffab3ab545e5

---------------------------------------------------------------------------------------------------------------------------------------------------

---------------------------------------------------------------------------------------------------------------------------------------------------

https://preview.redd.it/hr7fr1uiq62d1.png?width=688&format=png&auto=webp&s=bd3de359bfe4c1ed82d092be92ae38c246bdfda2

---------------------------------------------------------------------------------------------------------------------------------------------------

---------------------------------------------------------------------------------------------------------------------------------------------------

https://preview.redd.it/v6k3v39kq62d1.png?width=450&format=png&auto=webp&s=c0abb0e397a498ef7ccfb35b1b1cb598198f66ad

For anyone looking to compare the Phi-3 benchmarks in one place.

Interesting comparisons for: ANLI, Hellaswag, MedQA, TriviaQA, Language understanding, Factual Knowledge and Robustness.

Note: Phi-3 mini model table have labels in different order.

benchmarks knowledge labels language language understanding machinelearning phi phi-3 robustness table understanding

More from www.reddit.com / Machine Learning

[D] 1D CNN on Waveforms and Spectrograms vs. 2D CNN Performance 6 hours ago | www.reddit.com

animals cnn converge humans +9

[D] Need help finding an old Geoffrey Hinton video 12 hours ago | www.reddit.com

digit geoff geoff hinton hinton +12

[Discussion] Diminishing Return problem as a Machine Learning Engineer. 13 hours ago | www.reddit.com

deal engineer features machine +10

[P] Created an open source version of "Math Notes" from Apple with GPT-4o! 13 hours ago | www.reddit.com

apple gpt gpt-4o machinelearning +3

[D] How to network at a conference 1 day ago | www.reddit.com

big conference cvpr google +11

[R] CFG++ : A simple fix for addressing the flaws of CFG in diffusion models 1 day, 6 hours ago | www.reddit.com

challenges classifier design diffusion +12

[D] Nemotron-4 340b detailed analysis 1 day, 14 hours ago | www.reddit.com

analysis llm look machinelearning +2

I Trained an LLM on My WhatsApp Chats to Impersonate Me [P] 1 day, 18 hours ago | www.reddit.com

chat chat history export feature +12

[P] Improved Text2SQL Dataset Now Available on Huggingface! 1 day, 19 hours ago | www.reddit.com

download experiment free machinelearning +1

Senior Data Engineer

@ Displate | Warsaw

View on ai-jobs.net

Decision Scientist

@ Tesco Bengaluru | Bengaluru, India

View on ai-jobs.net

Senior Technical Marketing Engineer (AI/ML-powered Cloud Security)

@ Palo Alto Networks | Santa Clara, CA, United States

View on ai-jobs.net

Associate Director, Technology & Data Lead - Remote

@ Novartis | East Hanover

View on ai-jobs.net

Product Manager, Generative AI

@ Adobe | San Jose

View on ai-jobs.net

Associate Director – Data Architect Corporate Functions

@ Novartis | Prague

View on ai-jobs.net