NMF with time-frequency activations to model non-stationary audio events

Romain Hennequin; Roland Badeau; Bertrand David

Article Dans Une Revue IEEE_J_ASLP Année : 2011

NMF with time-frequency activations to model non-stationary audio events

(1) , (1) , (1)

Romain Hennequin

Fonction : Auteur

Laboratoire Traitement et Communication de l'Information

Roland Badeau

Fonction : Auteur
PersonId : 1121
IdHAL : rbadeau
ORCID : 0000-0002-9630-6877
IdRef : 106938134

Laboratoire Traitement et Communication de l'Information

Bertrand David

Fonction : Auteur
PersonId : 179908
IdHAL : bedavid
ORCID : 0000-0003-1153-422X

Laboratoire Traitement et Communication de l'Information

Résumé

Real world sounds often exhibit time-varying spectral shapes, as observed in the spectrogram of a harpsichord tone or that of a transition between two pronounced vowels. Whereas the standard Non-negative Matrix Factorization (NMF) assumes fixed spectral atoms, an extension is proposed where the temporal activations (coefficients of the decomposition on the spectral atom basis) become frequency dependent and follow a timevarying ARMA modeling. This extension can thus be interpreted with the help of a source/filter paradigm and is referred to as source/filter factorization. This factorization leads to an efficient single-atom decomposition for a single audio event with strong spectral variation (but with constant pitch). The new algorithm is tested on real audio data and shows promising results.

Domaines

Traitement du signal et de l'image [eess.SP] Traitement du signal et de l'image [eess.SP]

Fichier principal

Hennequin-TASLP2011.pdf (1.22 Mo)

Origine : Fichiers produits par l'(les) auteur(s)

Roland Badeau : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-00945201

Soumis le : mardi 25 mars 2014-09:17:55

Dernière modification le : lundi 9 octobre 2023-12:49:40

Archivage à long terme le : mercredi 25 juin 2014-10:42:17

Dates et versions

hal-00945201 , version 1 (25-03-2014)

Identifiants

HAL Id : hal-00945201 , version 1

Citer

Romain Hennequin, Roland Badeau, Bertrand David. NMF with time-frequency activations to model non-stationary audio events. IEEE_J_ASLP, 2011, 19 (4), pp.744--753. ⟨hal-00945201⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

INSTITUT-TELECOM CNRS PARISTECH LTCI IDS S2A

160 Consultations

650 Téléchargements

NMF with time-frequency activations to model non-stationary audio events

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager