Transform Quantization for CNN Compression

Virtual: https://events.vtools.ieee.org/m/274143 Virtual: https://events.vtools.ieee.org/m/274143

In this work, we compress convolutional neural network (CNN) weights post-training via transform quantization. Previous CNN quantization techniques tend to ignore the joint statistics of weights and activations, producing sub-optimal... Read more