Fast dictionary learning for sparse representations of speech signals

Maria G. Jafari; Mark D. Plumbley

doi:10.1109/JSTSP.2011.2157892

Article Dans Une Revue IEEE Journal of Selected Topics in Signal Processing Année : 2011

Fast dictionary learning for sparse representations of speech signals

(1) , (1)

Maria G. Jafari

Fonction : Auteur

Centre for Digital Music

Mark D. Plumbley

Fonction : Auteur
PersonId : 871792

Centre for Digital Music

Résumé

For dictionary-based decompositions of certain types, it has been observed that there might be a link between sparsity in the dictionary and sparsity in the decomposition. Sparsity in the dictionary has also been associated with the derivation of fast and efficient dictionary learning algorithms. Therefore, in this paper we present a greedy adaptive dictionary learning algorithm that sets out to find sparse atoms for speech signals. The algorithm learns the dictionary atoms on data frames taken from a speech signal. It iteratively extracts the data frame with minimum sparsity index, and adds this to the dictionary matrix. The contribution of this atom to the data frames is then removed, and the process is repeated. The algorithm is found to yield a sparse signal decomposition, supporting the hypothesis of a link between sparsity in the decomposition and dictionary. The algorithm is applied to the problem of speech representation and speech denoising, and its performance is compared to other existing methods. The method is shown to find dictionary atoms that are sparser than their time-domain waveform, and also to result in a sparser speech representation. In the presence of noise, the algorithm is found to have similar performance to the well established principal component analysis.

Mots clés

Sparse decomposition adaptive dictionary sparse dictionary dictionary learning speech analysis speech denoising

Domaines

Traitement du signal et de l'image [eess.SP] Traitement du signal et de l'image [eess.SP]

Fichier principal

JafariPlumbley11-jstsp_accepted.pdf (815.13 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Rémi Gribonval : Connectez-vous pour contacter le contributeur

https://inria.hal.science/inria-00599051

Soumis le : vendredi 11 septembre 2015-12:12:01

Dernière modification le : mardi 3 juillet 2018-12:56:02

Archivage à long terme le : mardi 29 décembre 2015-00:31:51

Dates et versions

inria-00599051 , version 1 (08-06-2011)

inria-00599051 , version 2 (11-09-2015)

Identifiants

HAL Id : inria-00599051 , version 2
DOI : 10.1109/JSTSP.2011.2157892

Citer

Maria G. Jafari, Mark D. Plumbley. Fast dictionary learning for sparse representations of speech signals. IEEE Journal of Selected Topics in Signal Processing, 2011, 5 (5), pp.17. ⟨10.1109/JSTSP.2011.2157892⟩. ⟨inria-00599051v2⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

260 Consultations

1686 Téléchargements

Fast dictionary learning for sparse representations of speech signals

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Altmetric

Partager