Please use this identifier to cite or link to this item:
https://hdl.handle.net/10356/184050
Title: | Controllable image generation using diffusion models | Authors: | Hooi, Marcus Wai Kit | Keywords: | Computer and Information Science | Issue Date: | 2025 | Publisher: | Nanyang Technological University | Source: | Hooi, M. W. K. (2025). Controllable image generation using diffusion models. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/184050 | Project: | CCDS24-0069 | Abstract: | This research investigates advanced control mechanisms for diffusion-based image generation models, focusing on developing a comprehensive pipeline for highly controllable outputs. The study evaluates four key control approaches: prompt engineering, ControlNet guidance, Low-Rank Adaptation (LoRA), and inpainting techniques. Through systematic experimentation with the Flux.dev diffusion model, the research demonstrates significant improvements in character consistency, pose control, and targeted image refinement. The findings reveal that while prompt engineering provides effective stylistic guidance, it faces limitations in spatial relationship control and character consistency. LoRA training offers a resource-efficient alternative to full model fine-tuning, though with slightly reduced accuracy. ControlNet integration proves particularly effective for structural and pose guidance, with different pre-processors showing distinct strengths in various applications. The research culminates in a unified pipeline that combines these mechanisms, demonstrating practical applications in content creation, professional photography, and e-commerce. The study's outcomes provide valuable insights for implementing controllable image generation in real-world applications while identifying areas for future technical enhancement and research directions. | URI: | https://hdl.handle.net/10356/184050 | Schools: | College of Computing and Data Science | Fulltext Permission: | restricted | Fulltext Availability: | With Fulltext |
Appears in Collections: | CCDS Student Reports (FYP/IA/PA/PI) |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
Controllable Image Generation using Diffusion Models.pdf Restricted Access | 2.07 MB | Adobe PDF | View/Open |
Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.