Aug. 27, 2022, 12:12 p.m. | /u/maekawatoshiki

Machine Learning www.reddit.com

I ran a gemm kernel using OpenBLAS and other famous libraries for linear algebra, but it is always slower than running GEMM op on ONNX Runtime (on CPU).

What kind of magic is used in ONNX Runtime to greatly improve gemm performance?

machinelearning onnx

Data Scientist (m/f/x/d)

@ Symanto Research GmbH & Co. KG | Spain, Germany

AI Scientist/Engineer

@ OKX | Singapore

Research Engineering/ Scientist Associate I

@ The University of Texas at Austin | AUSTIN, TX

Senior Data Engineer

@ Algolia | London, England

Fundamental Equities - Vice President, Equity Quant Research Analyst (Income & Value Investment Team)

@ BlackRock | NY7 - 50 Hudson Yards, New York

Snowflake Data Analytics

@ Devoteam | Madrid, Spain