NMF with time-frequency activations to model non-stationary audio events

Abstract : Real world sounds often exhibit time-varying spectral shapes, as observed in the spectrogram of a harpsichord tone or that of a transition between two pronounced vowels. Whereas the standard Non-negative Matrix Factorization (NMF) assumes fixed spectral atoms, an extension is proposed where the temporal activations (coefficients of the decomposition on the spectral atom basis) become frequency dependent and follow a timevarying ARMA modeling. This extension can thus be interpreted with the help of a source/filter paradigm and is referred to as source/filter factorization. This factorization leads to an efficient single-atom decomposition for a single audio event with strong spectral variation (but with constant pitch). The new algorithm is tested on real audio data and shows promising results.
Liste complète des métadonnées

Cited literature [27 references]  Display  Hide  Download

https://hal.inria.fr/hal-00945201
Contributor : Roland Badeau <>
Submitted on : Tuesday, March 25, 2014 - 9:17:55 AM
Last modification on : Wednesday, February 20, 2019 - 2:40:25 PM
Document(s) archivé(s) le : Wednesday, June 25, 2014 - 10:42:17 AM

File

Hennequin-TASLP2011.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-00945201, version 1

Collections

Citation

Romain Hennequin, Roland Badeau, Bertrand David. NMF with time-frequency activations to model non-stationary audio events. IEEE_J_ASLP, IEEE, 2011, 19 (4), pp.744--753. ⟨hal-00945201⟩

Share

Metrics

Record views

409

Files downloads

486