Data Streaming with Affinity Propagation

Xiangliang Zhang 1 Cyril Furtlehner 1 Michèle Sebag 1
1 TAO - Machine Learning and Optimisation
CNRS - Centre National de la Recherche Scientifique : UMR8623, Inria Saclay - Ile de France, UP11 - Université Paris-Sud - Paris 11, LRI - Laboratoire de Recherche en Informatique
Abstract : This paper proposed StrAP (Streaming AP), extending Affinity Propagation (AP) to data steaming. AP, a new clustering algorithm, extracts the data items, or exemplars, that best represent the dataset using a message passing method. Several steps are made to build StrAP. The first one (Weighted AP) extends AP to weighted items with no loss of generality. The second one (Hierarchical WAP) is concerned with reducing the quadratic AP complexity, by applying AP on data subsets and further applying Weighted AP on the exemplars extracted from all subsets. Finally StrAP extends Hierarchical WAP to deal with changes in the data distribution. Experiments on artificial datasets, on the Intrusion Detection benchmark (KDD99) and on a real-world problem, clustering the stream of jobs submitted to the EGEE grid system, provide a comparative validation of the approach.
Document type :
Conference papers
Complete list of metadatas

Cited literature [21 references]  Display  Hide  Download

https://hal.inria.fr/inria-00289679
Contributor : Xiangliang Zhang <>
Submitted on : Wednesday, June 25, 2008 - 11:53:15 AM
Last modification on : Thursday, April 5, 2018 - 12:30:12 PM
Long-term archiving on : Friday, November 25, 2016 - 10:18:22 PM

File

ECML08_final3.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : inria-00289679, version 3

Collections

Citation

Xiangliang Zhang, Cyril Furtlehner, Michèle Sebag. Data Streaming with Affinity Propagation. European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, Sep 2008, Antwerp, Belgium. ⟨inria-00289679v3⟩

Share

Metrics

Record views

262

Files downloads

506