A variational EM algorithm for large-scale mixture modeling

Jakob Verbeek; Nikos Vlassis; Jan Nunnink

Communication Dans Un Congrès Année : 2003

A variational EM algorithm for large-scale mixture modeling

(1) , (1) , (1)

Jakob Verbeek

Fonction : Auteur
PersonId : 10676
IdHAL : verbeek
ORCID : 0000-0003-1419-1816
IdRef : 180998463

Instituut voor Informatica

Nikos Vlassis

Fonction : Auteur
PersonId : 853678

Instituut voor Informatica

Jan Nunnink

Fonction : Auteur

Instituut voor Informatica

Résumé

Mixture densities constitute a rich family of models that can be used in several data mining and machine learning applications, for instance, clustering. Although practical algorithms exist for learning such models from data, these algorithms typically do not scale very well with large datasets. Our approach, which builds on previous work by other authors, offers an acceleration of the EM algorithm for Gaussian mixtures by precomputing and storing sufficient statistics of the data in the nodes of a kd-tree. Contrary to other works, we obtain algorithms that strictly increase a lower bound on the data log-likelihood in every learning step. Experimental results illustrate the validity of our approach.

Mots clés

Gaussian mixture EM algorithm variational approximation clustering very large database

Domaines

Apprentissage [cs.LG]

Fichier principal

verbeek03asci2.pdf (84.31 Ko)

Origine : Fichiers éditeurs autorisés sur une archive ouverte

Jakob Verbeek : Connectez-vous pour contacter le contributeur

https://inria.hal.science/inria-00321486

Soumis le : mercredi 2 février 2011-14:34:27

Dernière modification le : lundi 25 septembre 2017-10:08:04

Archivage à long terme le : mardi 3 mai 2011-02:28:07

Dates et versions

inria-00321486 , version 1 (02-02-2011)

inria-00321486 , version 2 (08-03-2011)

Identifiants

HAL Id : inria-00321486 , version 1

Citer

Jakob Verbeek, Nikos Vlassis, Jan Nunnink. A variational EM algorithm for large-scale mixture modeling. 9th Annual Conference of the Advanced School for Computing and Imaging (ASCI '03), Jun 2003, Heijen, Netherlands. pp.136--143. ⟨inria-00321486v1⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

364 Consultations

565 Téléchargements

A variational EM algorithm for large-scale mixture modeling

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Partager