Fast dictionary learning for sparse representations of speech signals

Abstract : For dictionary-based decompositions of certain types, it has been observed that there might be a link between sparsity in the dictionary and sparsity in the decomposition. Sparsity in the dictionary has also been associated with the derivation of fast and efficient dictionary learning algorithms. Therefore, in this paper we present a greedy adaptive dictionary learning algorithm that sets out to find sparse atoms for speech signals. The algorithm learns the dictionary atoms on data frames taken from a speech signal. It iteratively extracts the data frame with minimum sparsity index, and adds this to the dictionary matrix. The contribution of this atom to the data frames is then removed, and the process is repeated. The algorithm is found to yield a sparse signal decomposition, supporting the hypothesis of a link between sparsity in the decomposition and dictionary. The algorithm is applied to the problem of speech representation and speech denoising, and its performance is compared to other existing methods. The method is shown to find dictionary atoms that are sparser than their time-domain waveform, and also to result in a sparser speech representation. In the presence of noise, the algorithm is found to have similar performance to the well established principal component analysis.
Type de document :
Article dans une revue
IEEE journal of selected topics in Signal Processing, special issue on Adaptive Sparse Representation of Data and Applications in Signal and Image Processing., IEEE Signal Processing Society, 2011, 5 (5), pp.17. 〈10.1109/JSTSP.2011.2157892〉
Liste complète des métadonnées

Littérature citée [26 références]  Voir  Masquer  Télécharger

https://hal.inria.fr/inria-00599051
Contributeur : Rémi Gribonval <>
Soumis le : vendredi 11 septembre 2015 - 12:12:01
Dernière modification le : jeudi 8 octobre 2015 - 01:04:31
Document(s) archivé(s) le : mardi 29 décembre 2015 - 00:31:51

Fichier

JafariPlumbley11-jstsp_accepte...
Fichiers produits par l'(les) auteur(s)

Identifiants

Collections

Citation

Maria Jafari, Mark D. Plumbley. Fast dictionary learning for sparse representations of speech signals. IEEE journal of selected topics in Signal Processing, special issue on Adaptive Sparse Representation of Data and Applications in Signal and Image Processing., IEEE Signal Processing Society, 2011, 5 (5), pp.17. 〈10.1109/JSTSP.2011.2157892〉. 〈inria-00599051v2〉

Partager

Métriques

Consultations de la notice

133

Téléchargements de fichiers

535