Supervised Group Nonnegative Matrix Factorisation With Similarity Constraints And Applications To Speaker Identification

Romain Serizel 1 Victor Bisot 2 Slim Essid 2 Gaël Richard 2
1 MULTISPEECH - Speech Modeling for Facilitating Oral-Based Communication
Inria Nancy - Grand Est, LORIA - NLPKD - Department of Natural Language Processing & Knowledge Discovery
Abstract : This paper presents supervised feature learning approaches for speaker identification that rely on nonnegative matrix factorisa-tion. Recent studies have shown that group nonnegative matrix factorisation and task-driven supervised dictionary learning can help performing effective feature learning for audio classification problems. This paper proposes to integrate a recent method that relies on group nonnegative matrix factorisation into a task-driven supervised framework for speaker identification. The goal is to capture both the speaker variability and the session variability while exploiting the discriminative learning aspect of the task-driven approach. Results on a subset of the ESTER corpus prove that the proposed approach can be competitive with I-vectors. Index Terms— Nonnegative matrix factorisation, feature learning , dictionary learning, online learning, speaker identification
Type de document :
Communication dans un congrès
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Mar 2017, New Orleans, United States. 2017, 〈http://www.ieee-icassp2017.org/〉
Liste complète des métadonnées

Littérature citée [27 références]  Voir  Masquer  Télécharger

https://hal.inria.fr/hal-01484744
Contributeur : Romain Serizel <>
Soumis le : mardi 7 mars 2017 - 16:25:06
Dernière modification le : jeudi 11 janvier 2018 - 06:27:31
Document(s) archivé(s) le : jeudi 8 juin 2017 - 14:22:38

Fichier

supervised-group-nonnegative.p...
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : hal-01484744, version 1

Citation

Romain Serizel, Victor Bisot, Slim Essid, Gaël Richard. Supervised Group Nonnegative Matrix Factorisation With Similarity Constraints And Applications To Speaker Identification. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Mar 2017, New Orleans, United States. 2017, 〈http://www.ieee-icassp2017.org/〉. 〈hal-01484744〉

Partager

Métriques

Consultations de la notice

388

Téléchargements de fichiers

115