Fast Gaussian Pairwise Constrained Spectral Clustering - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Communication Dans Un Congrès Année : 2014

Fast Gaussian Pairwise Constrained Spectral Clustering

Résumé

We consider the problem of spectral clustering with partial supervision in the form of must-link and cannot-link constraints. Such pairwise constraints are common in problems like coreference resolution in natural language processing. The approach developed in this paper is to learn a new representation space for the data together with a dis-tance in this new space. The representation space is obtained through a constraint-driven linear transformation of a spectral embedding of the data. Constraints are expressed with a Gaussian function that locally reweights the similarities in the projected space. A global, non-convex optimization objective is then derived and the model is learned via gradi-ent descent techniques. Our algorithm is evaluated on standard datasets and compared with state of the art algorithms, like [14,18,31]. Results on these datasets, as well on the CoNLL-2012 coreference resolution shared task dataset, show that our algorithm significantly outperforms related approaches and is also much more scalable.
Fichier principal
Vignette du fichier
fgpwc.pdf (5.29 Mo) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-01017269 , version 1 (18-07-2014)

Identifiants

Citer

David Chatel, Pascal Denis, Marc Tommasi. Fast Gaussian Pairwise Constrained Spectral Clustering. ECML/PKDD - 7th European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, Sep 2014, Nancy, France. pp.242 - 257, ⟨10.1007/978-3-662-44848-9_16⟩. ⟨hal-01017269⟩
271 Consultations
302 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More