Please use this identifier to cite or link to this item: https://hdl.handle.net/10356/184050
Title: Controllable image generation using diffusion models
Authors: Hooi, Marcus Wai Kit
Keywords: Computer and Information Science
Issue Date: 2025
Publisher: Nanyang Technological University
Source: Hooi, M. W. K. (2025). Controllable image generation using diffusion models. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/184050
Project: CCDS24-0069
Abstract: This research investigates advanced control mechanisms for diffusion-based image generation models, focusing on developing a comprehensive pipeline for highly controllable outputs. The study evaluates four key control approaches: prompt engineering, ControlNet guidance, Low-Rank Adaptation (LoRA), and inpainting techniques. Through systematic experimentation with the Flux.dev diffusion model, the research demonstrates significant improvements in character consistency, pose control, and targeted image refinement. The findings reveal that while prompt engineering provides effective stylistic guidance, it faces limitations in spatial relationship control and character consistency. LoRA training offers a resource-efficient alternative to full model fine-tuning, though with slightly reduced accuracy. ControlNet integration proves particularly effective for structural and pose guidance, with different pre-processors showing distinct strengths in various applications. The research culminates in a unified pipeline that combines these mechanisms, demonstrating practical applications in content creation, professional photography, and e-commerce. The study's outcomes provide valuable insights for implementing controllable image generation in real-world applications while identifying areas for future technical enhancement and research directions.
URI: https://hdl.handle.net/10356/184050
Schools: College of Computing and Data Science 
Fulltext Permission: restricted
Fulltext Availability: With Fulltext
Appears in Collections:CCDS Student Reports (FYP/IA/PA/PI)

Files in This Item:
File Description SizeFormat 
Controllable Image Generation using Diffusion Models.pdf
  Restricted Access
2.07 MBAdobe PDFView/Open

Page view(s)

30
Updated on May 7, 2025

Google ScholarTM

Check

Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.