Please use this identifier to cite or link to this item:
https://hdl.handle.net/10356/173324
Title: | SmooSeg: smoothness prior for unsupervised semantic segmentation | Authors: | Lan, Mengcheng Wang, Xinjiang Ke, Yiping Xu, Jiaxing Feng, Litong Zhang, Wayne |
Keywords: | Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence Engineering::Computer science and engineering::Computing methodologies::Image processing and computer vision |
Issue Date: | 2023 | Source: | Lan, M., Wang, X., Ke, Y., Xu, J., Feng, L. & Zhang, W. (2023). SmooSeg: smoothness prior for unsupervised semantic segmentation. 37th Conference on Neural Information Processing Systems (NeurIPS 2023), 1-21. | Project: | IAF-ICP MOE-T2EP20220- 0006 |
Conference: | 37th Conference on Neural Information Processing Systems (NeurIPS 2023) | Abstract: | Unsupervised semantic segmentation is a challenging task that segments images into semantic groups without manual annotation. Prior works have primarily focused on leveraging prior knowledge of semantic consistency or priori concepts from self-supervised learning methods, which often overlook the coherence property of image segments. In this paper, we demonstrate that the smoothness prior, asserting that close features in a metric space share the same semantics, can significantly simplify segmentation by casting unsupervised semantic segmentation as an energy minimization problem. Under this paradigm, we propose a novel approach called SmooSeg that harnesses self-supervised learning methods to model the closeness relationships among observations as smoothness signals. To effectively discover coherent semantic segments, we introduce a novel smoothness loss that promotes piecewise smoothness within segments while preserving discontinuities across different segments. Additionally, to further enhance segmentation quality, we design an asymmetric teacher-student style predictor that generates smoothly updated pseudo labels, facilitating an optimal fit between observations and labeling outputs. Thanks to the rich supervision cues of the smoothness prior, our SmooSeg significantly outperforms STEGO in terms of pixel accuracy on three datasets: COCOStuff (+14.9%), Cityscapes (+13.0%), and Potsdam-3 (+5.7%). | URI: | https://hdl.handle.net/10356/173324 | URL: | https://neurips.cc/ | Schools: | School of Computer Science and Engineering | Organisations: | SenseTime Research | Research Centres: | S-Lab | Rights: | © 2023 The Author(s). Published by Neural Information Processing Systems. All rights reserved. This article may be downloaded for personal use only. Any other use requires prior permission of the copyright holder. | Fulltext Permission: | open | Fulltext Availability: | With Fulltext |
Appears in Collections: | SCSE Conference Papers |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
NeurIPS2023_SmooSeg Smoothness Prior for Unsupervised Semantic Segmentation.pdf | 7.75 MB | Adobe PDF | View/Open |
Page view(s)
89
Updated on Sep 15, 2024
Download(s) 50
40
Updated on Sep 15, 2024
Google ScholarTM
Check
Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.