Clustering Nominal and Numerical Data: A New Distance Concept for a Hybrid Genetic Algorithm - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Communication Dans Un Congrès Année : 2004

Clustering Nominal and Numerical Data: A New Distance Concept for a Hybrid Genetic Algorithm

Résumé

As intrinsic structures, like the number of clusters, is, for real data, a major issue of the clustering problem, we propose, in this paper, CHyGA (Clustering Hybrid Genetic Algorithm) an hybrid genetic algorithm for clustering. CHyGA treats the clustering problem as an optimization problem and searches for an optimal number of clusters characterized by an optimal distribution of instances into the clusters. CHyGA introduces a new representation of solutions and uses dedicated operators, such as one iteration of K-means as a mutation operator. In order to deal with nominal data, we propose a new definition of the cluster center concept and demonstrate its properties. Experimental results on classical benchmarks are given.

Mots clés

Fichier principal
Vignette du fichier
jourdan_evocop04.pdf (204.67 Ko) Télécharger le fichier

Dates et versions

inria-00001183 , version 1 (30-03-2006)

Identifiants

  • HAL Id : inria-00001183 , version 1

Citer

Laetitia Jourdan, Clarisse Dhaenens, El-Ghazali Talbi. Clustering Nominal and Numerical Data: A New Distance Concept for a Hybrid Genetic Algorithm. Evolutionary Computation in Combinatorial Optimization -- {EvoCOP}~2004, Apr 2004, Coimbra, Portugal, pp.220--229. ⟨inria-00001183⟩
155 Consultations
897 Téléchargements

Partager

Gmail Facebook X LinkedIn More