Please use this identifier to cite or link to this item: https://hdl.handle.net/10356/180233
Title: StructLDM: structured latent diffusion for 3D human generation
Authors: Hu, Tao
Hong, Fangzhou
Liu, Ziwei
Keywords: Computer and Information Science
Issue Date: 2024
Source: Hu, T., Hong, F. & Liu, Z. (2024). StructLDM: structured latent diffusion for 3D human generation. 2024 European Conference on Computer Vision (ECCV). https://dx.doi.org/10.48550/arXiv.2404.01241
Project: MOET2EP20221- 0012
NTU-NAP
IAF-ICP
RIE2020
Conference: 2024 European Conference on Computer Vision (ECCV)
Abstract: Recent 3D human generative models have achieved remarkable progress by learning 3D-aware GANs from 2D images. However, existing 3D human generative methods model humans in a compact 1D latent space, ignoring the articulated structure and semantics of human body topology. In this paper, we explore more expressive and higher-dimensional latent space for 3D human modeling and propose StructLDM, a diffusion-based unconditional 3D human generative model, which is learned from 2D images. StructLDM solves the challenges imposed due to the high-dimensional growth of latent space with three key designs: 1) A semantic structured latent space defined on the dense surface manifold of a statistical human body template. 2) A structured 3D-aware auto-decoder that factorizes the global latent space into several semantic body parts parameterized by a set of conditional structured local NeRFs anchored to the body template, which embeds the properties learned from the 2D training data and can be decoded to render view-consistent humans under different poses and clothing styles. 3) A structured latent diffusion model for generative human appearance sampling. Extensive experiments validate StructLDM's state-of-the-art generation performance and illustrate the expressiveness of the structured latent space over the well-adopted 1D latent space. Notably, StructLDM enables different levels of controllable 3D human generation and editing, including pose/view/shape control, and high-level tasks including compositional generations, part-aware clothing editing, 3D virtual try-on, etc. Our project page is at: https://taohuumd.github.io/projects/StructLDM/.
URI: https://hdl.handle.net/10356/180233
URL: http://arxiv.org/abs/2404.01241v3
DOI: 10.48550/arXiv.2404.01241
DOI (Related Dataset): 10.21979/N9/BXUEXV
Schools: College of Computing and Data Science 
Research Centres: S-Lab
Rights: © 2024 ECCV. All rights reserved. This article may be downloaded for personal use only. Any other use requires prior permission of the copyright holder.
Fulltext Permission: open
Fulltext Availability: With Fulltext
Appears in Collections:CCDS Conference Papers

Files in This Item:
File Description SizeFormat 
StructLDM Structured Latent Diffusion for 3D Human Generation.pdfPreprint37.77 MBAdobe PDFView/Open

Page view(s)

80
Updated on Dec 4, 2024

Download(s)

6
Updated on Dec 4, 2024

Google ScholarTM

Check

Altmetric


Plumx

Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.