all AI news
QAPPA: Quantization-Aware Power, Performance, and Area Modeling of DNN Accelerators. (arXiv:2205.08648v1 [cs.AR])
cs.LG updates on arXiv.org arxiv.org
As the machine learning and systems community strives to achieve higher
energy-efficiency through custom DNN accelerators and model compression
techniques, there is a need for a design space exploration framework that
incorporates quantization-aware processing elements into the accelerator design
space while having accurate and fast power, performance, and area models. In
this work, we present QAPPA, a highly parameterized quantization-aware power,
performance, and area modeling framework for DNN accelerators. Our framework
can facilitate the future research on design space exploration …
ar arxiv dnn dnn accelerators modeling performance power quantization