Optimization of Initial Centroids for K-Means Algorithm Based on Small World Network

Abstract : K-means algorithm is a relatively simple and fast gather clustering algorithm. However, the initial clustering center of the traditional k-means algorithm was generated randomly from the dataset, and the clustering result was unstable. In this paper, we propose a novel method to optimize the selection of initial centroids for k-means algorithm based on the small world network. This paper firstly models a text document set as a network which has small world phenomenon and then use small-world’s characteristics to form k initial centroids. Experimental evaluation on documents croups show clustering results (total cohesion, purity, recall) obtained by proposed method comparable with traditional k-means algorithm. The experiments show that results are obtained by the proposed algorithm can be relatively stability and efficiency. Therefore, this method can be considered as an effective application in the domain of text documents, especially in using text clustering for topic detection.
Type de document :
Communication dans un congrès
Zhongzhi Shi; David Leake; Sunil Vadera. 7th International Conference on Intelligent Information Processing (IIP), Oct 2012, Guilin, China. Springer, IFIP Advances in Information and Communication Technology, AICT-385, pp.87-96, 2012, Intelligent Information Processing VI. 〈10.1007/978-3-642-32891-6_13〉
Liste complète des métadonnées

Littérature citée [10 références]  Voir  Masquer  Télécharger

https://hal.inria.fr/hal-01524955
Contributeur : Hal Ifip <>
Soumis le : vendredi 19 mai 2017 - 10:43:16
Dernière modification le : vendredi 19 mai 2017 - 10:45:57
Document(s) archivé(s) le : lundi 21 août 2017 - 00:30:27

Fichier

978-3-642-32891-6_13_Chapter.p...
Fichiers produits par l'(les) auteur(s)

Licence


Distributed under a Creative Commons Paternité 4.0 International License

Identifiants

Citation

Shimo Shen, Zuqiang Meng. Optimization of Initial Centroids for K-Means Algorithm Based on Small World Network. Zhongzhi Shi; David Leake; Sunil Vadera. 7th International Conference on Intelligent Information Processing (IIP), Oct 2012, Guilin, China. Springer, IFIP Advances in Information and Communication Technology, AICT-385, pp.87-96, 2012, Intelligent Information Processing VI. 〈10.1007/978-3-642-32891-6_13〉. 〈hal-01524955〉

Partager

Métriques

Consultations de la notice

91

Téléchargements de fichiers

26