Please use this identifier to cite or link to this item: https://hdl.handle.net/10356/155131
Title: Addressing challenges in real-world image classification : long-tailed distribution and knowledge distillation
Authors: Wang, Yiming
Keywords: Engineering::Computer science and engineering::Computing methodologies::Image processing and computer vision
Issue Date: 2021
Publisher: Nanyang Technological University
Source: Wang, Y. (2021). Addressing challenges in real-world image classification : long-tailed distribution and knowledge distillation. Master's thesis, Nanyang Technological University, Singapore. https://hdl.handle.net/10356/155131
Abstract: In computer vision, image classification has progressed rapidly with deep learning over the ten years. However, in the real world, we still face challenges to apply them when the datasets are highly imbalanced, or in some situations to deploy large networks. From the data perspective, in this thesis, we aim to improve data augmentations for long-tailed image classification, where only a few semantic classes possess many samples while most other classes have only a few samples. We propose a novel Hybrid Mixup strategy to increase the sample amount and diversity, where we uncover the efficacy of mixup in the latent space of StyleGAN2. Compared with the traditional mixup method on real images, the mixup images generated from the interpolated latent codes have better quality. Experiments on CIFAR-10-LT, CIFAR-100-LT demonstrate that our proposed Hybrid Mixup consistently boosts the head-, medium- and tail-class classification accuracy compared with the traditional mixup method on real images only. Moreover, our results are on par with the state of the arts or even surpass them in some settings. From the model viewpoint, we particularly research the knowledge distillation, which leverages large models to distill enriched knowledge into smaller ones. Here we focus on the scenario where teachers output one-hot predictions only. We find it still possible for students to boost classification accuracy by directly learning from these one-hot predictions. We further propose Patched One-hot Distillation that models empirical probability for teachers to capture the inter-class relationship. Experiments on CIFAR-100 and ImageNet datasets demonstrate that our proposed method helps students learn better than the baseline that directly learns from both the ground-truth labels and the predictions from teachers.
URI: https://hdl.handle.net/10356/155131
DOI: 10.32657/10356/155131
Rights: This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License (CC BY-NC 4.0).
Fulltext Permission: embargo_20230206
Fulltext Availability: With Fulltext
Appears in Collections:SCSE Theses

Files in This Item:
File Description SizeFormat 
Wang Yiming_thesis_SCSE(MENG)_amended.pdf
  Until 2023-02-06
1.45 MBAdobe PDFUnder embargo until Feb 06, 2023

Page view(s)

58
Updated on May 20, 2022

Google ScholarTM

Check

Altmetric


Plumx

Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.