Please use this identifier to cite or link to this item:
https://hdl.handle.net/10356/183977
Title: | Automatic image colorization | Authors: | Teng, Royce | Keywords: | Computer and Information Science | Issue Date: | 2025 | Publisher: | Nanyang Technological University | Source: | Teng, R. (2025). Automatic image colorization. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/183977 | Project: | CCDS24-0519 | Abstract: | Recent advancements in image colorization models—both unconditional and conditional— have achieved remarkable results, generating rich and vibrant colours. These models also allows users to guide the colorization process through input conditions such as text prompts, brush strokes, reference exemplar images, or combinations of them. Colorization has seen significantly improved output quality and color richness through breakthroughs in deep learning architectures, such as Transformers, GANs, and Stable Diffusion. However, most of these research prioritized enhancing the quality of colorization over improving inference speed. While large models like stable diffusion produce impressive results, their slowprocessing speeds remain a key limitation. In this paper, we propose a novel framework that utilizes ControlNet architecture for image colorization and then perform model distillation on it. ControlNet is trained for both unconditional and prompt-based colorization, using LLaVA to generate image prompts. The trained ControlNet model serves as a teacher in the model distillation pipeline, training a smaller, faster student model designed to deliver significantly quicker outputs with negligible performance degradation. We conduct extensive experiments on the ImageNet and COCO-stuff datasets to evaluate the proposed approach against stateof- the-art models for both unconditional and prompt-based colorization. This work aims to advance the field by offering an approach that combines vivid and vibrant colorization with fast inference speed, paving the way for more practical and efficient applications. | URI: | https://hdl.handle.net/10356/183977 | Schools: | College of Computing and Data Science | Fulltext Permission: | restricted | Fulltext Availability: | With Fulltext |
Appears in Collections: | CCDS Student Reports (FYP/IA/PA/PI) |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
NTU_CCDS_FYP_Template (2).pdf Restricted Access | 16.11 MB | Adobe PDF | View/Open |
Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.