[D] Why is ONNX Runtime so fast?

Aug. 27, 2022, 12:12 p.m. | /u/maekawatoshiki

I ran a gemm kernel using OpenBLAS and other famous libraries for linear algebra, but it is always slower than running GEMM op on ONNX Runtime (on CPU).

What kind of magic is used in ONNX Runtime to greatly improve gemm performance?

machinelearning onnx

Visit resource

More from www.reddit.com / Machine Learning

[D] Llama-3 (7B and 70B) on a medical domain benchmark 10 hours ago | www.reddit.com

70b ai community benchmark community +10

[D] ICML Meta Reviews 11 hours ago | www.reddit.com

machinelearning

[R] Show Your Work with Confidence: Confidence Bands for Tuning Curves 12 hours ago | www.reddit.com

abstract accounting function hyperparameter +11

[R] InternVL v1.5 open sourced, ranking first in OpenCompass multi-modal benchmark 12 hours ago | www.reddit.com

benchmark cvpr demo download +7

[N] Meta releases Llama 3 12 hours ago | www.reddit.com

machinelearning

[R] MineDreamer: Learning to Follow Instructions via Chain-of-Imagination for Simulated-World Control 12 hours ago | www.reddit.com

abstract agent design diverse +12

[R] Compression Represents Intelligence Linearly 13 hours ago | www.reddit.com

abstract advanced belief compression +13

[D] Product evaluations is one of the most under-discussed topics 13 hours ago | www.reddit.com

ai consultancy cases client consultancy +8

[D] 100+ labels text-classification problem. Whats the “usual” approach? Transformers? 14 hours ago | www.reddit.com

boosting classification data ensemble +10

Data Scientist (m/f/x/d)

@ Symanto Research GmbH & Co. KG | Spain, Germany

View on ai-jobs.net

AI Scientist/Engineer

@ OKX | Singapore

View on ai-jobs.net

Research Engineering/ Scientist Associate I

@ The University of Texas at Austin | AUSTIN, TX

View on ai-jobs.net

Senior Data Engineer

@ Algolia | London, England

View on ai-jobs.net

Fundamental Equities - Vice President, Equity Quant Research Analyst (Income & Value Investment Team)

@ BlackRock | NY7 - 50 Hudson Yards, New York

View on ai-jobs.net

Snowflake Data Analytics

@ Devoteam | Madrid, Spain

View on ai-jobs.net

View more jobs

all AI news

[D] Why is ONNX Runtime so fast?

More from www.reddit.com / Machine Learning

Jobs in AI, ML, Big Data

Data Scientist (m/f/x/d)

AI Scientist/Engineer

Research Engineering/ Scientist Associate I

Senior Data Engineer

Fundamental Equities - Vice President, Equity Quant Research Analyst (Income & Value Investment Team)

Snowflake Data Analytics