Hardware-oriented Approximation of Convolutional Neural Networks

Mohammad Motamedi; Philipp Gysel; Soheil Ghiasi

arxiv: 1604.03168 · v3 · pith:F5JXDY5Jnew · submitted 2016-04-11 · 💻 cs.CV

Hardware-oriented Approximation of Convolutional Neural Networks

Philipp Gysel , Mohammad Motamedi , Soheil Ghiasi This is my paper

classification 💻 cs.CV

keywords ristrettoapproximationconvolutionalpointcondensefixedgivenhardware-oriented

0 comments

read the original abstract

High computational complexity hinders the widespread usage of Convolutional Neural Networks (CNNs), especially in mobile devices. Hardware accelerators are arguably the most promising approach for reducing both execution time and power consumption. One of the most important steps in accelerator development is hardware-oriented model approximation. In this paper we present Ristretto, a model approximation framework that analyzes a given CNN with respect to numerical resolution used in representing weights and outputs of convolutional and fully connected layers. Ristretto can condense models by using fixed point arithmetic and representation instead of floating point. Moreover, Ristretto fine-tunes the resulting fixed point network. Given a maximum error tolerance of 1%, Ristretto can successfully condense CaffeNet and SqueezeNet to 8-bit. The code for Ristretto is available.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Evolutionary fine tuning of quantized convolution-based deep learning models
cs.LG 2026-04 unverdicted novelty 5.0

Evolutionary fine-tuning of select weights in pre-quantized convolutional networks improves accuracy over standard rounding for VGG, ResNet, and autoencoder models.
FusionAccel: A General Re-configurable Deep Learning Inference Accelerator on FPGA for Convolutional Neural Networks
cs.AR 2019-07 unverdicted novelty 4.0

FusionAccel is a scalable, runtime-reconfigurable RTL CNN inference accelerator implemented and verified on Xilinx Spartan-6 FPGA with results identical to Caffe-CPU and designed for ASIC migration.