Fast dictionary learning for sparse representations of speech signals - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Article Dans Une Revue IEEE Journal of Selected Topics in Signal Processing Année : 2011

Fast dictionary learning for sparse representations of speech signals

Mark D. Plumbley
  • Fonction : Auteur
  • PersonId : 871792

Résumé

For dictionary-based decompositions of certain types, it has been observed that there might be a link between sparsity in the dictionary and sparsity in the decomposition. Sparsity in the dictionary has also been associated with the derivation of fast and efficient dictionary learning algorithms. Therefore, in this paper we present a greedy adaptive dictionary learning algorithm that sets out to find sparse atoms for speech signals. The algorithm learns the dictionary atoms on data frames taken from a speech signal. It iteratively extracts the data frame with minimum sparsity index, and adds this to the dictionary matrix. The contribution of this atom to the data frames is then removed, and the process is repeated. The algorithm is found to yield a sparse signal decomposition, supporting the hypothesis of a link between sparsity in the decomposition and dictionary. The algorithm is applied to the problem of speech representation and speech denoising, and its performance is compared to other existing methods. The method is shown to find dictionary atoms that are sparser than their time-domain waveform, and also to result in a sparser speech representation. In the presence of noise, the algorithm is found to have similar performance to the well established principal component analysis.
Fichier principal
Vignette du fichier
JafariPlumbley11-jstsp_accepted.pdf (815.13 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

inria-00599051 , version 1 (08-06-2011)
inria-00599051 , version 2 (11-09-2015)

Identifiants

Citer

Maria G. Jafari, Mark D. Plumbley. Fast dictionary learning for sparse representations of speech signals. IEEE Journal of Selected Topics in Signal Processing, 2011, 5 (5), pp.17. ⟨10.1109/JSTSP.2011.2157892⟩. ⟨inria-00599051v2⟩
260 Consultations
1686 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More