Efficient image classification by spatial transform bottleneck

HKUST Electronic Theses

Efficient image classification by spatial transform bottleneck

by Ho Man Kwan

THESIS 2021

M.Phil. Electronic and Computer Engineering

1 online resource (xv, 46 pages) : illustrations (some color)

Abstract

Image classification is a fundamental problem in computer vision. Although deep neural networks can surpass human vision in classifying images, most of these neural networks require expensive computational resources.

In this thesis, a neural network layer based on Spatial Transformer Network is proposed to improve the efficiency of neural networks. We call this structure “spatial transform bottleneck”, as an analogy to the broadly utilized bottleneck layer in ResNet that reduces the channel dimension, where the proposed structure applies spatial transformation to reduce the spatial dimension. The proposed neural network layer performs three operations on the input feature maps. First, a spatial transformer is introduced to resample input features into a lower dimensional space. Then fea...[ Read more ]

View Copyrighted to the author. Reproduction is prohibited without the author’s prior written consent.

Details

Collection HKUST Electronic Theses Degree M.Phil. Department Electronic and Computer Engineering Supervisors Song, Shenghui Shi, Bertram E. Authors Kwan, Ho Man Language English Call number Thesis ECE 2021 Kwan DOI 10.14711/thesis-991013039228403412

Full record

Efficient image classification by spatial transform bottleneck

by Ho Man Kwan

Post a Comment Cancel reply