Skip to Main content Skip to Navigation
Conference papers

Clustering Nominal and Numerical Data: A New Distance Concept for a Hybrid Genetic Algorithm

Laetitia Jourdan 1 Clarisse Dhaenens 1 El-Ghazali Talbi 1
1 DOLPHIN - Parallel Cooperative Multi-criteria Optimization
LIFL - Laboratoire d'Informatique Fondamentale de Lille, Inria Lille - Nord Europe
Abstract : As intrinsic structures, like the number of clusters, is, for real data, a major issue of the clustering problem, we propose, in this paper, CHyGA (Clustering Hybrid Genetic Algorithm) an hybrid genetic algorithm for clustering. CHyGA treats the clustering problem as an optimization problem and searches for an optimal number of clusters characterized by an optimal distribution of instances into the clusters. CHyGA introduces a new representation of solutions and uses dedicated operators, such as one iteration of K-means as a mutation operator. In order to deal with nominal data, we propose a new definition of the cluster center concept and demonstrate its properties. Experimental results on classical benchmarks are given.
Document type :
Conference papers
Complete list of metadatas

https://hal.inria.fr/inria-00001183
Contributor : Laetitia Jourdan <>
Submitted on : Thursday, March 30, 2006 - 1:35:56 PM
Last modification on : Thursday, May 28, 2020 - 9:22:09 AM
Document(s) archivé(s) le : Saturday, April 3, 2010 - 10:09:23 PM

Identifiers

  • HAL Id : inria-00001183, version 1

Citation

Laetitia Jourdan, Clarisse Dhaenens, El-Ghazali Talbi. Clustering Nominal and Numerical Data: A New Distance Concept for a Hybrid Genetic Algorithm. Evolutionary Computation in Combinatorial Optimization -- ~2004, Apr 2004, Coimbra, Portugal, pp.220--229. ⟨inria-00001183⟩

Share

Metrics

Record views

405

Files downloads

1326