NMF with time-frequency activations to model non-stationary audio events

Abstract : Real world sounds often exhibit time-varying spectral shapes, as observed in the spectrogram of a harpsichord tone or that of a transition between two pronounced vowels. Whereas the standard Non-negative Matrix Factorization (NMF) assumes fixed spectral atoms, an extension is proposed where the temporal activations (coefficients of the decomposition on the spectral atom basis) become frequency dependent and follow a timevarying ARMA modeling. This extension can thus be interpreted with the help of a source/filter paradigm and is referred to as source/filter factorization. This factorization leads to an efficient single-atom decomposition for a single audio event with strong spectral variation (but with constant pitch). The new algorithm is tested on real audio data and shows promising results.
Type de document :
Article dans une revue
IEEE_J_ASLP, IEEE, 2011, 19 (4), pp.744--753
Liste complète des métadonnées

Littérature citée [27 références]  Voir  Masquer  Télécharger

Contributeur : Roland Badeau <>
Soumis le : mardi 25 mars 2014 - 09:17:55
Dernière modification le : jeudi 11 janvier 2018 - 06:23:39
Document(s) archivé(s) le : mercredi 25 juin 2014 - 10:42:17


Fichiers produits par l'(les) auteur(s)


  • HAL Id : hal-00945201, version 1


Romain Hennequin, Roland Badeau, Bertrand David. NMF with time-frequency activations to model non-stationary audio events. IEEE_J_ASLP, IEEE, 2011, 19 (4), pp.744--753. 〈hal-00945201〉



Consultations de la notice


Téléchargements de fichiers