Please use this identifier to cite or link to this item:
https://hdl.handle.net/10356/183877
Title: | Detailed text-to-image generation | Authors: | Soh, Mary Hwee Choon | Keywords: | Computer and Information Science | Issue Date: | 2025 | Publisher: | Nanyang Technological University | Source: | Soh, M. H. C. (2025). Detailed text-to-image generation. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/183877 | Abstract: | Logo generation models have been assisting designers in creating aesthetically pleasing domain-specific logos. However, these models often lack the ability to produce unique designs as they typically rely on predefined templates. In contrast, popular image generation models like DALL-E and Stable Diffusion demonstrate exceptional capabilities in generating high quality images but struggle when it comes to creating coherent visual text and unique logos. Logo image-caption datasets have also been scarce, which are crucial for finetuning image generation models to improve their visual text generation capabilities. This report details the collection and processing of logo images and their captions to develop a high quality, well-annotated dataset. To evaluate the quality and effectiveness of the dataset, it is used to finetune text-to-image generation models, to assess the improvement of the finetuned models' text generation ability in images. | URI: | https://hdl.handle.net/10356/183877 | Schools: | College of Computing and Data Science | Fulltext Permission: | restricted | Fulltext Availability: | With Fulltext |
Appears in Collections: | CCDS Student Reports (FYP/IA/PA/PI) |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
FYP_Amended_Final_Report_Mary_Soh.pdf Restricted Access | 37.87 MB | Adobe PDF | View/Open |
Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.