Please use this identifier to cite or link to this item: https://hdl.handle.net/10356/184071
Title: Intrinsicdiff: extracting intrinsic image components from diffusion models
Authors: Li,, Zihan
Keywords: Computer and Information Science
Issue Date: 2025
Publisher: Nanyang Technological University
Source: Li, ,. Z. (2025). Intrinsicdiff: extracting intrinsic image components from diffusion models. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/184071
Abstract: Intrinsic image properties such as surface normal maps, depth maps, shading, and albedo provide a decomposition of scenes that is broadly useful for image editing and scene understanding. This paper proposes an approach to intrinsic image decomposition using latent diffusion models, using the generative prior of Stable Diffusion (versions 1.5, 2.1, and 3) to learn and extract these intrinsic properties. We adapt the pre-trained diffusion model via Low-Rank Adaptation (LoRA) and an extended U-Net architecture, enabling the network to output multiple intrinsic channels with minimal additional parameters. The method is trained on the DIODE dataset , augmented with pseudo ground-truth shading and albedo to provide supervision for all intrinsic components. We attempt to explore single-step (feed-forward) strategy: while multi-step diffusion generates more detailed and accurate intrinsic outputs at the cost of increased compu- tation, single-step inference offers a faster solution with slightly reduced detail. Our results demonstrate that diffusion models can be effectively repurposed for intrinsic image decomposition, highlighting a promising avenue for generative models to yield rich intrinsic representations of scenes. Furthermore, we investigate the impact of model complexity on the quality of intrinsic decomposition.
URI: https://hdl.handle.net/10356/184071
Schools: College of Computing and Data Science 
Fulltext Permission: restricted
Fulltext Availability: With Fulltext
Appears in Collections:CCDS Student Reports (FYP/IA/PA/PI)

Files in This Item:
File Description SizeFormat 
Zihan_Li_FYP.pdf
  Restricted Access
71.63 MBAdobe PDFView/Open

Page view(s)

43
Updated on May 7, 2025

Download(s)

2
Updated on May 7, 2025

Google ScholarTM

Check

Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.