QONNX: Representing Arbitrary-Precision Quantized Neural Networks. (arXiv:2206.07527v3 [cs.LG] UPDATED) | allainews.com

June 27, 2022, 1:11 a.m. | Alessandro Pappalardo, Yaman Umuroglu, Michaela Blott, Jovan Mitrevski, Ben Hawks, Nhan Tran, Vladimir Loncar, Sioni Summers, Hendrik Borras, Jules Mu

stat.ML updates on arXiv.org arxiv.org

We present extensions to the Open Neural Network Exchange (ONNX) intermediate
representation format to represent arbitrary-precision quantized neural
networks. We first introduce support for low precision quantization in existing
ONNX-based quantization formats by leveraging integer clipping, resulting in
two new backward-compatible variants: the quantized operator format with
clipping and quantize-clip-dequantize (QCDQ) format. We then introduce a novel
higher-level ONNX format called quantized ONNX (QONNX) that introduces three
new operators -- Quant, BipolarQuant, and Trunc -- in order to represent
uniform …

arxiv lg networks neural networks precision

More from arxiv.org / stat.ML updates on arXiv.org

Distributional Preference Learning: Understanding and Accounting for Hidden Context in RLHF 19 hours ago | arxiv.org

accounting arxiv context cs.ai +6

Hacking Task Confounder in Meta-Learning 19 hours ago | arxiv.org

abstract arxiv cs.lg hacking +12

Reflection coupling for unadjusted generalized Hamiltonian Monte Carlo in the nonconvex stochastic gradient case 19 hours ago | arxiv.org

abstract algorithms arxiv case +10

Provable Reward-Agnostic Preference-Based Reinforcement Learning 19 hours ago | arxiv.org

abstract agent arxiv cs.ai +16

Mastering Diverse Domains through World Models 19 hours ago | arxiv.org

abstract algorithm algorithms application +22

Precise Asymptotics for Spectral Methods in Mixed Generalized Linear Models 19 hours ago | arxiv.org

abstract arxiv cs.it cs.lg +14

Additive Covariance Matrix Models: Modelling Regional Electricity Net-Demand in Great Britain 19 hours ago | arxiv.org

abstract arxiv britain consumption +18

Learning Algorithm Generalization Error Bounds via Auxiliary Distributions 19 hours ago | arxiv.org

abstract algorithm arxiv cs.it +16

Forecasting Algorithms for Causal Inference with Panel Data 19 hours ago | arxiv.org

abstract adapt algorithm algorithms +23

Data Scientist (m/f/x/d)

@ Symanto Research GmbH & Co. KG | Spain, Germany

View on ai-jobs.net

Enterprise Data Quality, Senior Analyst

@ Toyota North America | Plano

View on ai-jobs.net

Data Analyst & Audit Management Software (AMS) Coordinator

@ World Vision | Philippines - Home Working

View on ai-jobs.net

Product Manager Power BI Platform Tech I&E Operational Insights

@ ING | HBP (Amsterdam - Haarlerbergpark)

View on ai-jobs.net

Sr. Director, Software Engineering, Clinical Data Strategy

@ Moderna | USA-Washington-Seattle-1099 Stewart Street

View on ai-jobs.net

Data Engineer (Data as a Service)

@ Xplor | Atlanta, GA, United States

View on ai-jobs.net