Please use this identifier to cite or link to this item: https://hdl.handle.net/10356/184371
Title: Unsupervised object segmentation with slot attention: validation and challenges on real-world data
Authors: Wang, Xuan
Keywords: Engineering
Issue Date: 2025
Publisher: Nanyang Technological University
Source: Wang, X. (2025). Unsupervised object segmentation with slot attention: validation and challenges on real-world data. Master's thesis, Nanyang Technological University, Singapore. https://hdl.handle.net/10356/184371
Abstract: Traditional deep learning methods often have difficulty preserving object-level semantics because they usually encode the entire scene into a single holistic representation. Object-centric learning (OCL) addresses this limitation by decomposing the scene into individual object representations, making the model more interpretable and composable. Among OCL methods, slot attention shows promising results on synthetic datasets, but its generalization to real-world data remains underexplored. This study investigates the segmentation performance of slot attention in both synthetic and natural environments. We first validate its effectiveness on the CLEVR6 dataset, confirming its ability to learn object-centric representations in structured environments. Building on this, we train the model on the COCO dataset to evaluate its performance on complex real-world scenes. By comparing the results on different datasets, we reveal key challenges such as background clutter, object occlusion, and slot competition, and propose corresponding potential improvement directions. Our results provide empirical insights into the limitations of slot attention in natural image segmentation and lay the foundation for future enhancements of object-centric architectures in practical applications.
URI: https://hdl.handle.net/10356/184371
Schools: School of Electrical and Electronic Engineering 
Fulltext Permission: restricted
Fulltext Availability: With Fulltext
Appears in Collections:EEE Theses

Files in This Item:
File Description SizeFormat 
MSC_dissertation_WANG XUAN.pdf
  Restricted Access
8.84 MBAdobe PDFView/Open

Page view(s)

25
Updated on May 7, 2025

Google ScholarTM

Check

Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.