A Simple Method to Reduce Off-chip Memory Accesses on Convolutional Neural Networks

Doyun Kim; Kyoung-Young Kim; Sanghyuck Ha; Sangsoo Ko

arxiv: 1901.09614 · v1 · pith:XDA4UZKOnew · submitted 2019-01-28 · 💻 cs.NE · cs.CV· cs.LG

A Simple Method to Reduce Off-chip Memory Accesses on Convolutional Neural Networks

Doyun Kim , Kyoung-Young Kim , Sangsoo Ko , Sanghyuck Ha This is my paper

classification 💻 cs.NE cs.CVcs.LG

keywords memoryoff-chipaccessesalgorithmneuralconvolutionalnetworksprocess

0 comments

read the original abstract

For convolutional neural networks, a simple algorithm to reduce off-chip memory accesses is proposed by maximally utilizing on-chip memory in a neural process unit. Especially, the algorithm provides an effective way to process a module which consists of multiple branches and a merge layer. For Inception-V3 on Samsung's NPU in Exynos, our evaluation shows that the proposed algorithm makes off-chip memory accesses reduced by 1/50, and accordingly achieves 97.59 % reduction in the amount of feature-map data to be transferred from/to off-chip memory.

This paper has not been read by Pith yet.

A Simple Method to Reduce Off-chip Memory Accesses on Convolutional Neural Networks

discussion (0)