Scaling Analysis of the Affinity Propagation Algorithm - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Rapport (Rapport De Recherche) Année : 2009

Scaling Analysis of the Affinity Propagation Algorithm

Résumé

We analyze and exploit some scaling properties of the Affinity Propagation (AP) clustering algorithm proposed by Frey and Dueck (2007). First we observe that a divide and conquer strategy, used on a large data set hierarchically reduces the complexity O(N^2) to O(N^{(h+2)/(h+1)}), for a data-set of size N and a depth h of the hierarchical strategy. For a data-set embedded in a d-dimensional space, we show that this is obtained without notably damaging the precision except in dimension d=2. In fact, for d larger than 2 the relative loss in precision scales like N^{(2-d)/(h+1)d}. Finally, under some conditions we observe that there is a value s* of the penalty coefficient, a free parameter used to fix the number of clusters, which separate a fragmentation phase (for ss*) of the underlying hidden cluster structure. At this precise point holds a self-similarity property which can be exploited by the hierarchical strategy to actually locate its position. From this observation, a strategy based on AP can be defined to find out how many cluster are present in a given dataset.
Fichier principal
Vignette du fichier
RR-7046.pdf (626.32 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

inria-00420407 , version 1 (28-09-2009)
inria-00420407 , version 2 (09-06-2010)

Identifiants

  • HAL Id : inria-00420407 , version 1

Citer

Cyril Furtlehner, Michèle Sebag, Zhang Xiangliang. Scaling Analysis of the Affinity Propagation Algorithm. [Research Report] Phys. Rev. E 81, 066102 (2010), 2009, pp.31. ⟨inria-00420407v1⟩

Collections

INRIA-RRRT
268 Consultations
297 Téléchargements

Partager

Gmail Facebook X LinkedIn More