A reconfigurable CNN accelerator with a novel nested winograd algorithm for FPGA

HKUST Electronic Theses

A reconfigurable CNN accelerator with a novel nested winograd algorithm for FPGA

by Jing Bo Jiang

THESIS 2020

M.Phil. Electronic and Computer Engineering

viii, 35 pages : illustrations ; 30 cm

Abstract

Winograd minimum filtering algorithm has been used to accelerate the compute-bound Convolutional Neural Networks (CNN) on FPGA in recent years. However, most of the Winograd accelerators are only designed to process convolution with limited filter size such as 3 × 3 efficiently. Large filters in the convolution are normally decomposed into small 3 × 3 tiles using a low-efficient algorithm called overlap-and-add (OLA) Winograd, which prevents the applications relying heavily on large filters such as image super resolution or neural network search (NAS) from further speed-up. This work solves this problem by proposing a novel decomposing algorithm named nested Winograd to replace OLA-Winograd. We also show that the proposed algorithm can be easily integrated into current 3 × 3 Win...[ Read more ]

View Copyrighted to the author. Reproduction is prohibited without the author’s prior written consent.

Details

Collection HKUST Electronic Theses Degree M.Phil. Department Electronic and Computer Engineering Authors Jiang, Jing Bo Subjects Neural networks (Computer science) Mathematical models Field programmable gate arrays Language English Call number Thesis ECED 2020 JiangJ DOI 10.14711/thesis-991012880062103412

Full record

A reconfigurable CNN accelerator with a novel nested winograd algorithm for FPGA

by Jing Bo Jiang

Post a Comment Cancel reply