HAL will be down for maintenance from Friday, June 10 at 4pm through Monday, June 13 at 9am. More information
Skip to Main content Skip to Navigation

An a contrario approach to hierarchical clustering validity assessment

Abstract : In this paper we present a method to detect natural groups in a data set, based on hierarchical clustering. A measure of the meaningfulness of clusters, derived from a background model assuming no class structure in the data, provides a way to compare clusters, and leads to a cluster validity criterion. This criterion is applied to every cluster in the nested structure. While all clusters passing the validity test are meaningful in themselves, the set of all of them will probably provide a redundant data representation. By selecting a subset of the meaningful clusters, a good data representation, which also discards outliers, can be achieved. The strategy we propose combines a new merging criterion (also derived from the background model) with a selection of local maxima of the meaningfulness with respect to inclusion, in the nested hierarchical structure.
Document type :
Complete list of metadata

Cited literature [14 references]  Display  Hide  Download

Contributor : Rapport de Recherche Inria Connect in order to contact the contributor
Submitted on : Friday, May 19, 2006 - 9:12:05 PM
Last modification on : Friday, February 4, 2022 - 3:25:29 AM
Long-term archiving on: : Sunday, April 4, 2010 - 9:42:46 PM


  • HAL Id : inria-00070682, version 1


Frédéric Cao, Julie Delon, Agnès Desolneux, Pablo Musé, Frédéric Sur. An a contrario approach to hierarchical clustering validity assessment. [Research Report] RR-5318, INRIA. 2004, pp.15. ⟨inria-00070682⟩



Record views


Files downloads