Skip to Main content Skip to Navigation
Journal articles

NMF with time-frequency activations to model non-stationary audio events

Abstract : Real world sounds often exhibit time-varying spectral shapes, as observed in the spectrogram of a harpsichord tone or that of a transition between two pronounced vowels. Whereas the standard Non-negative Matrix Factorization (NMF) assumes fixed spectral atoms, an extension is proposed where the temporal activations (coefficients of the decomposition on the spectral atom basis) become frequency dependent and follow a timevarying ARMA modeling. This extension can thus be interpreted with the help of a source/filter paradigm and is referred to as source/filter factorization. This factorization leads to an efficient single-atom decomposition for a single audio event with strong spectral variation (but with constant pitch). The new algorithm is tested on real audio data and shows promising results.
Complete list of metadata

Cited literature [27 references]  Display  Hide  Download
Contributor : Roland Badeau Connect in order to contact the contributor
Submitted on : Tuesday, March 25, 2014 - 9:17:55 AM
Last modification on : Wednesday, October 14, 2020 - 1:11:49 PM
Long-term archiving on: : Wednesday, June 25, 2014 - 10:42:17 AM


Files produced by the author(s)


  • HAL Id : hal-00945201, version 1



Romain Hennequin, Roland Badeau, Bertrand David. NMF with time-frequency activations to model non-stationary audio events. IEEE_J_ASLP, IEEE, 2011, 19 (4), pp.744--753. ⟨hal-00945201⟩



Record views


Files downloads