p-Spectral Clustering Based on Neighborhood Attribute Granulation

Abstract : Clustering analysis is an important method for data mining and information statistics. Data clustering is to find the intrinsic links between objects and describe the internal structures of data sets. p-Spectral clustering is based on Cheeger cut criterion. It has good performance on many challenging data sets. But the original p-spectral clustering algorithm is not suitable for high-dimensional data. To solve this problem, this paper improves p-spectral clustering using neighborhood attribute granulation and proposes NAG-pSC algorithm. Neighborhood rough sets can directly process the continuous data. We introduce information entropy into the neighborhood rough sets to weaken the negative impact of noise data and redundant attributes on clustering. In this way, the data points within the same cluster are more compact, while the data points between different clusters are more separate. The effectiveness of the proposed NAG-pSC algorithm is tested on several benchmark data sets. Experiments show that the neighborhood attribute granulation will highlight the differences between data points while maintaining their characteristics in the clustering. With the help of neighborhood attribute granulation, NAG-pSC is able to recognize more complex data structures and has strong robustness to the noise or irrelevant features in high-dimensional data.
Document type :
Conference papers
Complete list of metadatas

Cited literature [13 references]  Display  Hide  Download

https://hal.inria.fr/hal-01615003
Contributor : Hal Ifip <>
Submitted on : Wednesday, October 11, 2017 - 4:58:23 PM
Last modification on : Friday, November 3, 2017 - 10:24:06 PM
Long-term archiving on : Friday, January 12, 2018 - 3:44:31 PM

File

433802_1_En_6_Chapter.pdf
Files produced by the author(s)

Licence


Distributed under a Creative Commons Attribution 4.0 International License

Identifiers

Citation

Shifei Ding, Hongjie Jia, Mingjing Du, Qiankun Hu. p-Spectral Clustering Based on Neighborhood Attribute Granulation. 9th International Conference on Intelligent Information Processing (IIP), Nov 2016, Melbourne, VIC, Australia. pp.50-58, ⟨10.1007/978-3-319-48390-0_6⟩. ⟨hal-01615003⟩

Share

Metrics

Record views

481

Files downloads

78