Please use this identifier to cite or link to this item: https://hdl.handle.net/10356/173324
Title: SmooSeg: smoothness prior for unsupervised semantic segmentation
Authors: Lan, Mengcheng
Wang, Xinjiang
Ke, Yiping
Xu, Jiaxing
Feng, Litong
Zhang, Wayne
Keywords: Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence
Engineering::Computer science and engineering::Computing methodologies::Image processing and computer vision
Issue Date: 2023
Source: Lan, M., Wang, X., Ke, Y., Xu, J., Feng, L. & Zhang, W. (2023). SmooSeg: smoothness prior for unsupervised semantic segmentation. 37th Conference on Neural Information Processing Systems (NeurIPS 2023), 1-21.
Project: IAF-ICP 
MOE-T2EP20220- 0006 
Conference: 37th Conference on Neural Information Processing Systems (NeurIPS 2023)
Abstract: Unsupervised semantic segmentation is a challenging task that segments images into semantic groups without manual annotation. Prior works have primarily focused on leveraging prior knowledge of semantic consistency or priori concepts from self-supervised learning methods, which often overlook the coherence property of image segments. In this paper, we demonstrate that the smoothness prior, asserting that close features in a metric space share the same semantics, can significantly simplify segmentation by casting unsupervised semantic segmentation as an energy minimization problem. Under this paradigm, we propose a novel approach called SmooSeg that harnesses self-supervised learning methods to model the closeness relationships among observations as smoothness signals. To effectively discover coherent semantic segments, we introduce a novel smoothness loss that promotes piecewise smoothness within segments while preserving discontinuities across different segments. Additionally, to further enhance segmentation quality, we design an asymmetric teacher-student style predictor that generates smoothly updated pseudo labels, facilitating an optimal fit between observations and labeling outputs. Thanks to the rich supervision cues of the smoothness prior, our SmooSeg significantly outperforms STEGO in terms of pixel accuracy on three datasets: COCOStuff (+14.9%), Cityscapes (+13.0%), and Potsdam-3 (+5.7%).
URI: https://hdl.handle.net/10356/173324
URL: https://neurips.cc/
Schools: School of Computer Science and Engineering 
Organisations: SenseTime Research 
Research Centres: S-Lab
Rights: © 2023 The Author(s). Published by Neural Information Processing Systems. All rights reserved. This article may be downloaded for personal use only. Any other use requires prior permission of the copyright holder.
Fulltext Permission: open
Fulltext Availability: With Fulltext
Appears in Collections:SCSE Conference Papers

Files in This Item:
File Description SizeFormat 
NeurIPS2023_SmooSeg Smoothness Prior for Unsupervised Semantic Segmentation.pdf7.75 MBAdobe PDFThumbnail
View/Open

Page view(s)

89
Updated on Sep 15, 2024

Download(s) 50

40
Updated on Sep 15, 2024

Google ScholarTM

Check

Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.