Retraining-free Model Quantization via One-Shot Weight-Coupling Learning | allainews.com

June 17, 2024, 4:47 a.m. | Chen Tang, Yuan Meng, Jiacheng Jiang, Shuzhao Xie, Rongwei Lu, Xinzhu Ma, Zhi Wang, Wenwu Zhu

cs.CV updates on arXiv.org arxiv.org

arXiv:2401.01543v2 Announce Type: replace
Abstract: Quantization is of significance for compressing the over-parameterized deep neural models and deploying them on resource-limited devices. Fixed-precision quantization suffers from performance drop due to the limited numerical representation ability. Conversely, mixed-precision quantization (MPQ) is advocated to compress the model effectively by allocating heterogeneous bit-width for layers. MPQ is typically organized into a searching-retraining two-stage process. In this paper, we devise a one-shot training-searching paradigm for mixed-precision model compression. Specifically, in the first stage, all …

arxiv cs.cv free quantization replace retraining type via

More from arxiv.org / cs.CV updates on arXiv.org

DK-SLAM: Monocular Visual SLAM with Deep Keypoint Learning, Tracking and Loop-Closing 8 hours ago | arxiv.org

abstract arxiv benchmarks continuous +19

VideoMap: Supporting Video Editing Exploration, Brainstorming, and Prototyping in the Latent Space 8 hours ago | arxiv.org

arxiv cs.cv cs.hc cs.mm +7

Soundify: Matching Sound Effects to Video 8 hours ago | arxiv.org

abstract art arxiv cs.cv +19

The Ninth NTIRE 2024 Efficient Super-Resolution Challenge Report 8 hours ago | arxiv.org

arxiv challenge cs.cv eess.iv +4

LKM-UNet: Large Kernel Vision Mamba UNet for Medical Image Segmentation 8 hours ago | arxiv.org

arxiv cs.ai cs.cv image +8

Domain Adaptation based Object Detection for Autonomous Driving in Foggy and Rainy Weather 8 hours ago | arxiv.org

abstract arxiv autonomous autonomous driving +19

Metrics for Dataset Demographic Bias: A Case Study on Facial Expression Recognition 8 hours ago | arxiv.org

arxiv bias case case study +9

Videogenic: Identifying Highlight Moments in Videos with Professional Photographs as a Prior 8 hours ago | arxiv.org

abstract arxiv challenge cs.cv +17

Probabilistic Approach for Detection of High-Frequency Periodic Signals using an Event Camera 8 hours ago | arxiv.org

abstract acquisition arxiv asynchronous +15

AI Focused Biochemistry Postdoctoral Fellow

@ Lawrence Berkeley National Lab | Berkeley, CA

View on ai-jobs.net

Senior Data Engineer

@ Displate | Warsaw

View on ai-jobs.net

Staff Software Engineer (Data Platform)

@ Phaidra | Remote

View on ai-jobs.net

Distributed Compute Engineer

@ Magic | San Francisco

View on ai-jobs.net

Power Platform Developer/Consultant

@ Euromonitor | Bengaluru, Karnataka, India

View on ai-jobs.net

Finance Project Senior Manager

@ QIMA | London, United Kingdom

View on ai-jobs.net