Please use this identifier to cite or link to this item: https://hdl.handle.net/10356/183877
Title: Detailed text-to-image generation
Authors: Soh, Mary Hwee Choon
Keywords: Computer and Information Science
Issue Date: 2025
Publisher: Nanyang Technological University
Source: Soh, M. H. C. (2025). Detailed text-to-image generation. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/183877
Abstract: Logo generation models have been assisting designers in creating aesthetically pleasing domain-specific logos. However, these models often lack the ability to produce unique designs as they typically rely on predefined templates. In contrast, popular image generation models like DALL-E and Stable Diffusion demonstrate exceptional capabilities in generating high quality images but struggle when it comes to creating coherent visual text and unique logos. Logo image-caption datasets have also been scarce, which are crucial for finetuning image generation models to improve their visual text generation capabilities. This report details the collection and processing of logo images and their captions to develop a high quality, well-annotated dataset. To evaluate the quality and effectiveness of the dataset, it is used to finetune text-to-image generation models, to assess the improvement of the finetuned models' text generation ability in images.
URI: https://hdl.handle.net/10356/183877
Schools: College of Computing and Data Science 
Fulltext Permission: restricted
Fulltext Availability: With Fulltext
Appears in Collections:CCDS Student Reports (FYP/IA/PA/PI)

Files in This Item:
File Description SizeFormat 
FYP_Amended_Final_Report_Mary_Soh.pdf
  Restricted Access
37.87 MBAdobe PDFView/Open

Page view(s)

17
Updated on May 7, 2025

Download(s)

5
Updated on May 7, 2025

Google ScholarTM

Check

Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.