Towards mixed image data augmentation : from optimization criteria to accumulating regularization

HKUST Electronic Theses

Towards mixed image data augmentation : from optimization criteria to accumulating regularization

by Lok Yin Lee

THESIS 2023

M.Phil. Electronic and Computer Engineering

1 online resource (xii, 60 pages) : illustrations (some color)

Abstract

Data augmentation effectively regularizes deep learning models with less effort than obtaining additional data. Recently, many Mixed Sample Data Augmentation (MSDA) methods have been proposed, mixing two or more images into one augmented image. These mixing strategies generalize networks significantly better than traditional augmentation methods like cropping and flipping. They can improve models’ robustness against corrupted data and capabilities in downstream tasks such as object detection. However, most of these methods still have label mismatching problems, and the improvement criteria must be clarified.

This thesis introduces a solution for the label mismatching problem, an experiment on improvement criteria MSDA, and a compelling image mixing strategy, RandMix. The proposed Semantic Proportional Label Generation (SPLG) procedure generates the accurate label after mixing. SPLG is a generic method applicable to data mixing or wrapping techniques. Second, improvement criteria are proposed with the experiment to investigate their effect. From the three commonly known criteria (Data Diversity, Saliency Information, and Local Smoothness), we experimented and observed that data diversity is more critical to standard MSDA methods. Third, a practical mixing strategy RandMix is proposed. It recursively applies augmentations (MixUp, SaliencyMix, or ResizeMix) with a semantic proportional label generation procedure. RandMix provides significant improvement with negligible computational cost by accumulating the regularization effects. RandMix outperforms other CIFAR and ImageNet classification strategies and, surpasses the popular augmentation strategies on object detection tasks. Moreover, RandMix can combine with other traditional augmentation strategies (such as RandAugment) to obtain the highest model’s robustness against input corruption. Based on these extensive experiments, this thesis shows the effectiveness of applying MSDA techniques with a dedicated application strategy and demonstrates the benefit of MSDA.

[ Hide abstract ]

View Copyrighted to the author. Reproduction is prohibited without the author’s prior written consent.

Details

Collection HKUST Electronic Theses Degree M.Phil. Department Electronic and Computer Engineering Supervisors Shi, Bertram E. Woo, Kam Tim Authors Lee, Lok Yin Language English Call number Thesis ECE 2023 LeeL DOI 10.14711/thesis-991013181658903412

Full record

Towards mixed image data augmentation : from optimization criteria to accumulating regularization

by Lok Yin Lee

Post a Comment Cancel reply