A variational EM algorithm for large-scale mixture modeling - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Communication Dans Un Congrès Année : 2003

A variational EM algorithm for large-scale mixture modeling

Jakob Verbeek
Nikos Vlassis
  • Fonction : Auteur
  • PersonId : 853678

Résumé

Mixture densities constitute a rich family of models that can be used in several data mining and machine learning applications, for instance, clustering. Although practical algorithms exist for learning such models from data, these algorithms typically do not scale very well with large datasets. Our approach, which builds on previous work by other authors, offers an acceleration of the EM algorithm for Gaussian mixtures by precomputing and storing sufficient statistics of the data in the nodes of a kd-tree. Contrary to other works, we obtain algorithms that strictly increase a lower bound on the data log-likelihood in every learning step. Experimental results illustrate the validity of our approach.
Fichier principal
Vignette du fichier
verbeek03asci2.pdf (84.31 Ko) Télécharger le fichier
Vignette du fichier
VVN03.png (26.2 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Format : Figure, Image
Loading...

Dates et versions

inria-00321486 , version 1 (02-02-2011)
inria-00321486 , version 2 (08-03-2011)

Identifiants

  • HAL Id : inria-00321486 , version 2

Citer

Jakob Verbeek, Nikos Vlassis, Jan Nunnink. A variational EM algorithm for large-scale mixture modeling. 9th Annual Conference of the Advanced School for Computing and Imaging (ASCI '03), Jun 2003, Heijen, Netherlands. pp.136--143. ⟨inria-00321486v2⟩
364 Consultations
565 Téléchargements

Partager

Gmail Facebook X LinkedIn More