Supervised Group Nonnegative Matrix Factorisation With Similarity Constraints And Applications To Speaker Identification

Romain Serizel; Victor Bisot; Slim Essid; Gael Richard

Communication Dans Un Congrès Année : 2017

Supervised Group Nonnegative Matrix Factorisation With Similarity Constraints And Applications To Speaker Identification

(1) , (2) , (2) , (2)

1
2

Romain Serizel

Fonction : Auteur
PersonId : 10320
IdHAL : romain-serizel
IdRef : 223797391

Speech Modeling for Facilitating Oral-Based Communication

Victor Bisot

Fonction : Auteur

Laboratoire Traitement et Communication de l'Information

Slim Essid

Fonction : Auteur
PersonId : 181234
IdHAL : slimessid
ORCID : 0000-0002-0028-327X
IdRef : 11025130X

Laboratoire Traitement et Communication de l'Information

Gael Richard

Fonction : Auteur
PersonId : 14146
IdHAL : gael-richard
IdRef : 094977208

Laboratoire Traitement et Communication de l'Information

Résumé

This paper presents supervised feature learning approaches for speaker identification that rely on nonnegative matrix factorisa-tion. Recent studies have shown that group nonnegative matrix factorisation and task-driven supervised dictionary learning can help performing effective feature learning for audio classification problems. This paper proposes to integrate a recent method that relies on group nonnegative matrix factorisation into a task-driven supervised framework for speaker identification. The goal is to capture both the speaker variability and the session variability while exploiting the discriminative learning aspect of the task-driven approach. Results on a subset of the ESTER corpus prove that the proposed approach can be competitive with I-vectors. Index Terms— Nonnegative matrix factorisation, feature learning , dictionary learning, online learning, speaker identification

Mots clés

Nonnegative matrix factorisation feature learning dictionary learning online learning speaker identificati

Domaines

Traitement du signal et de l'image [eess.SP]

Fichier principal

supervised-group-nonnegative.pdf (200.55 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Romain Serizel : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-01484744

Soumis le : mardi 7 mars 2017-16:25:06

Dernière modification le : lundi 9 octobre 2023-12:49:40

Archivage à long terme le : jeudi 8 juin 2017-14:22:38

Dates et versions

hal-01484744 , version 1 (07-03-2017)

Identifiants

HAL Id : hal-01484744 , version 1

Citer

Romain Serizel, Victor Bisot, Slim Essid, Gael Richard. Supervised Group Nonnegative Matrix Factorisation With Similarity Constraints And Applications To Speaker Identification. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Mar 2017, New Orleans, United States. ⟨hal-01484744⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

INSTITUT-TELECOM CNRS INRIA PARISTECH UNIV-LORRAINE INRIA2 LORIA LORIA-NLPKD LTCI IDS S2A

510 Consultations

269 Téléchargements

Supervised Group Nonnegative Matrix Factorisation With Similarity Constraints And Applications To Speaker Identification

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager