Neural network compression via quantization, channel pruning and beyond

HKUST Electronic Theses

Neural network compression via quantization, channel pruning and beyond

by Zechun Liu

THESIS 2021

Ph.D. Electronic and Computer Engineering

1 online resource (xii, 131 pages) : illustrations (some color)

Abstract

Deep Convolutional Neural Networks (CNNs) have achieved substantial advances in a wide range of vision tasks. However, the superior performance of CNNs usually requires powerful hardware with abundant computation and memory resources. Considering there are growing demands to run vision tasks on mobile devices, the limited storage and computing power of the mobile devices confines the high-performance model from being widely deployed. To mitigate this gap, we are motivated to compress the neural networks to facilitate their deployment on mobile devices.

In general, there are four major approaches for neural network compression. The first is designing a more compact neural network, manually or through neural architecture search (NAS). The second is quantizing the weights and activations...[ Read more ]

View Copyrighted to the author. Reproduction is prohibited without the author’s prior written consent.

Details

Collection HKUST Electronic Theses Degree Ph.D. Department Electronic and Computer Engineering Supervisors Cheng, Kwang-Ting Authors Liu, Zechun Subjects Computer vision Data processing Computer graphics Neural networks (Computer science) Machine learning Language English Call number Thesis ECE 2021 LiuZ DOI 10.14711/thesis-991012980216903412

Full record

Neural network compression via quantization, channel pruning and beyond

by Zechun Liu

Post a Comment Cancel reply