Please use this identifier to cite or link to this item:
Title: Stochastic gradient descent based fuzzy clustering for large data
Authors: Chen, Lihui
Wang, Yangtao
Mei, Jian-Ping
Keywords: DRNTU::Engineering::Electrical and electronic engineering::Electronic systems
Issue Date: 2014
Source: Wang, Y., Chen, L., & Mei, J.-P. (2014). Stochastic gradient descent based fuzzy clustering for large data. 2014 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE), 2511-2518.
Abstract: Data is growing at an unprecedented rate in commercial and scientific areas. Clustering algorithms for large data which require small memory consumption and scalability become increasingly important under this circumstance. In this paper, we propose a new clustering approach called stochastic gradient based fuzzy clustering(SGFC) which achieves the optimization based on stochastic approximation to handle such kind of large data. We derive an adaptive learning rate which can be updated incrementally and maintained automatically in gradient descent approach employed in SGFC. Moreover, SGFC is extended to a mini-batch SGFC to reduce the stochastic noise. Additionally, multi-pass SGFC is also proposed to improve the clustering performance. Experiments have been conducted on synthetic data to show the effectiveness of our derived adaptive learning rate. Experimental studies have been also conducted on several large benchmark datasets including real world image and document datasets. Compared with existing fuzzy clustering approaches for large data, the mini-batch SGFC shows comparable or better accuracy with significant less time consumption. These results demonstrate the great potential of SGFC for large data analysis.
DOI: 10.1109/FUZZ-IEEE.2014.6891755
Rights: © 2015 Institute of Electrical and Electronics Engineers (IEEE).
Fulltext Permission: open
Fulltext Availability: With Fulltext
Appears in Collections:EEE Conference Papers

Files in This Item:
File Description SizeFormat 
Stochastic gradient descent based fuzzy clustering for large data.pdf
2.81 MBAdobe PDFThumbnail

Google ScholarTM




Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.