Skip to Main content Skip to Navigation
Reports

An a contrario approach to hierarchical clustering validity assessment

Abstract : In this paper we present a method to detect natural groups in a data set, based on hierarchical clustering. A measure of the meaningfulness of clusters, derived from a background model assuming no class structure in the data, provides a way to compare clusters, and leads to a cluster validity criterion. This criterion is applied to every cluster in the nested structure. While all clusters passing the validity test are meaningful in themselves, the set of all of them will probably provide a redundant data representation. By selecting a subset of the meaningful clusters, a good data representation, which also discards outliers, can be achieved. The strategy we propose combines a new merging criterion (also derived from the background model) with a selection of local maxima of the meaningfulness with respect to inclusion, in the nested hierarchical structure.
Document type :
Reports
Complete list of metadata

Cited literature [14 references]  Display  Hide  Download

https://hal.inria.fr/inria-00070682
Contributor : Rapport de Recherche Inria <>
Submitted on : Friday, May 19, 2006 - 9:12:05 PM
Last modification on : Monday, February 15, 2021 - 10:50:25 AM
Long-term archiving on: : Sunday, April 4, 2010 - 9:42:46 PM

Identifiers

  • HAL Id : inria-00070682, version 1

Citation

Frédéric Cao, Julie Delon, Agnès Desolneux, Pablo Musé, Frédéric Sur. An a contrario approach to hierarchical clustering validity assessment. [Research Report] RR-5318, INRIA. 2004, pp.15. ⟨inria-00070682⟩

Share

Metrics

Record views

375

Files downloads

464