Sketching for Large-Scale Learning of Mixture Models

Nicolas Keriven; Anthony Bourrier; Rémi Gribonval; Patrick Pérez

Communication Dans Un Congrès Année : 2016

Sketching for Large-Scale Learning of Mixture Models

(1) , (1, 2) , (1) , (2)

1
2

Nicolas Keriven

Fonction : Auteur
PersonId : 4423
IdHAL : nicolas-keriven
ORCID : 0000-0002-3846-8763
IdRef : 223562270

Parcimonie et Nouveaux Algorithmes pour le Signal et la Modélisation Audio

Anthony Bourrier

Fonction : Auteur
PersonId : 140
IdHAL : anthony-bourrier

Parcimonie et Nouveaux Algorithmes pour le Signal et la Modélisation Audio

Technicolor [Cesson Sévigné]

Rémi Gribonval

Fonction : Auteur
PersonId : 1255
IdHAL : remi-gribonval
ORCID : 0000-0002-9450-8125
IdRef : 113181590

Parcimonie et Nouveaux Algorithmes pour le Signal et la Modélisation Audio

Patrick Pérez

Fonction : Auteur
PersonId : 1022281

Technicolor [Cesson Sévigné]

Résumé

Learning parameters from voluminous data can be prohibitive in terms of memory and computational requirements. We propose a "compressive learning'' framework where we first sketch the data by computing random generalized moments of the underlying probability distribution, then estimate mixture model parameters from the sketch using an iterative algorithm analogous to greedy sparse signal recovery. We exemplify our framework with the sketched estimation of Gaussian Mixture Models (GMMs). We experimentally show that our approach yields results comparable to the classical Expectation-Maximization (EM) technique while requiring significantly less memory and fewer computations when the number of database elements is large. We report large-scale experiments in speaker verification, where our approach makes it possible to fully exploit a corpus of 1000 hours of speech signal to learn a universal background model at scales computationally inaccessible to EM.

Mots clés

database sketch compressed learning Gaussian mixture Compressed Sensing

Domaines

Machine Learning [stat.ML] Traitement du signal et de l'image [eess.SP] Probabilités [math.PR]

Fichier principal

paper.pdf (324.02 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Nicolas Keriven : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-01208027

Soumis le : mardi 1 mars 2016-11:49:44

Dernière modification le : jeudi 18 avril 2024-16:57:32

Archivage à long terme le : mardi 31 mai 2016-11:19:36

Dates et versions

hal-01208027 , version 1 (01-10-2015)

hal-01208027 , version 2 (23-10-2015)

hal-01208027 , version 3 (01-03-2016)

Identifiants

HAL Id : hal-01208027 , version 3

Citer

Nicolas Keriven, Anthony Bourrier, Rémi Gribonval, Patrick Pérez. Sketching for Large-Scale Learning of Mixture Models. 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2016), Mar 2016, Shanghai, China. ⟨hal-01208027v3⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

INSTITUT-TELECOM UNIV-RENNES1 CNRS INRIA INSA-RENNES IRISA CENTRALESUPELEC IRISA-D5 INRIA2 UR1-MATH-STIC UR1-UFR-ISTIC UNIV-RENNES UR1-MATH-NUM

952 Consultations

896 Téléchargements

Sketching for Large-Scale Learning of Mixture Models

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Relations

Exporter

Collections

Partager