Please use this identifier to cite or link to this item: https://hdl.handle.net/10356/175206
Title: Efficacy of transformers and patch augmentation in boosting stability and performance of multi-illumination white balance task
Authors: Chopra, Dhruv
Keywords: Computer and Information Science
Issue Date: 2024
Publisher: Nanyang Technological University
Source: Chopra, D. (2024). Efficacy of transformers and patch augmentation in boosting stability and performance of multi-illumination white balance task. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/175206
Abstract: Color Constancy, or the ability to identify colors correctly independent of the illumination conditions, is a desirable quality for many computer vision models. Indeed, it has been demonstrated before that image classification, object detection & image segmentation models perform better on expertly White Balanced images. Thus, many approaches have been proposed to automatically correct the White Balance of images. Recently, there has been a marked interest in using Learning based methods, especially Deep Neural Networks for carrying out the White Balance Correction. In this paper, we suggest a new Patch Augmentation Strategy that improves the performance of the model on the CIEDE 2000 metric for all considered datasets. Additionally, the model trained using the Patch Augmentation Strategy achieves a better overall performance in the Multi Illumination task, outperforming the base- line on both MSE and CIEDE 2000 measures. As a secondary focus, we explore the use of a transformer backbone for enhancing performance on the White Balance Task. We discover that the Transformer model generates smoother images with lesser number of patches compared to the CNN model. However, the CNN model generates output images with a higher color fidelity and achieves better performance on all single illumination tasks. Throughout our research, we use an input resolution of 224x224x3 for all our trained models in the hopes that this would make our results more compatible with common downstream models. All of our models have been made publicly available at https://huggingface.co/DChops/White_Balance.
URI: https://hdl.handle.net/10356/175206
Schools: School of Computer Science and Engineering 
Fulltext Permission: restricted
Fulltext Availability: With Fulltext
Appears in Collections:SCSE Student Reports (FYP/IA/PA/PI)

Files in This Item:
File Description SizeFormat 
Amended_Final_Report.pdf
  Restricted Access
Final Amended Report16.84 MBAdobe PDFView/Open

Page view(s)

184
Updated on Mar 13, 2025

Download(s)

13
Updated on Mar 13, 2025

Google ScholarTM

Check

Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.