Sparsity in deep learning : from the efficiency and generalization perspective

HKUST Electronic Theses

Sparsity in deep learning : from the efficiency and generalization perspective

by Xiao Zhou

THESIS 2023

Ph.D. Computer Science and Engineering

1 online resource (xvi, 159 pages) : illustrations (some color)

Abstract

Sparsification is a natural idea to boost the inference and training efficiency and generalization performance of neural networks. For inference efficiency, it could work on a small sparse model with much less parameter counts and computational time while preserving comparable or even better generalization performance. For training efficiency, it works on a small sparse model with constrained model size during the whole training process, with sparsified forward and backward propagations. For generalization performance, besides the already effective IID (Independently Identically Distributed) setting, we also give a novel view of ultilizing sparsity to boost the generalizaton performance in OOD (Out of Distribution) setting. We could also sparsity the dataset to speed-up the training pro...[ Read more ]

View Copyrighted to the author. Reproduction is prohibited without the author’s prior written consent.

Details

Collection HKUST Electronic Theses Degree Ph.D. Department Computer Science and Engineering Supervisors Zhang, Tong Authors Zhou, Xiao Language English Call number Thesis CSE 2023 Zhou DOI 10.14711/thesis-991013248657603412

Full record

Sparsity in deep learning : from the efficiency and generalization perspective

by Xiao Zhou

Post a Comment Cancel reply