Please use this identifier to cite or link to this item: https://hdl.handle.net/10356/180240
Title: ComboVerse: compositional 3D assets creation using spatially-aware diffusion guidance
Authors: Chen, Yongwei
Wang, Tengfei
Wu, Tong
Pan, Xingang
Jia, Kui
Liu, Ziwei
Keywords: Computer and Information Science
Issue Date: 2024
Source: Chen, Y., Wang, T., Wu, T., Pan, X., Jia, K. & Liu, Z. (2024). ComboVerse: compositional 3D assets creation using spatially-aware diffusion guidance. 2024 European Conference on Computer Vision (ECCV). https://dx.doi.org/10.48550/arXiv.2403.12409
Conference: 2024 European Conference on Computer Vision (ECCV)
Abstract: Generating high-quality 3D assets from a given image is highly desirable in various applications such as AR/VR. Recent advances in single-image 3D generation explore feed-forward models that learn to infer the 3D model of an object without optimization. Though promising results have been achieved in single object generation, these methods often struggle to model complex 3D assets that inherently contain multiple objects. In this work, we present ComboVerse, a 3D generation framework that produces high-quality 3D assets with complex compositions by learning to combine multiple models. 1) We first perform an in-depth analysis of this ``multi-object gap'' from both model and data perspectives. 2) Next, with reconstructed 3D models of different objects, we seek to adjust their sizes, rotation angles, and locations to create a 3D asset that matches the given image. 3) To automate this process, we apply spatially-aware score distillation sampling (SSDS) from pretrained diffusion models to guide the positioning of objects. Our proposed framework emphasizes spatial alignment of objects, compared with standard score distillation sampling, and thus achieves more accurate results. Extensive experiments validate ComboVerse achieves clear improvements over existing methods in generating compositional 3D assets.
URI: https://hdl.handle.net/10356/180240
URL: http://arxiv.org/abs/2403.12409v1
DOI: 10.48550/arXiv.2403.12409
DOI (Related Dataset): 10.21979/N9/BAZCX6
Schools: College of Computing and Data Science 
Research Centres: S-Lab
Rights: © 2024 ECCV. All rights reserved. This article may be downloaded for personal use only. Any other use requires prior permission of the copyright holder.
Fulltext Permission: open
Fulltext Availability: With Fulltext
Appears in Collections:CCDS Conference Papers

Files in This Item:
File Description SizeFormat 
ComboVerse Compositional 3D Assets Creation Using Spatially-Aware Diffusion Guidance.pdfPreprint21.41 MBAdobe PDFView/Open

Page view(s)

78
Updated on Dec 3, 2024

Download(s)

17
Updated on Dec 3, 2024

Google ScholarTM

Check

Altmetric


Plumx

Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.