An efficient solution to sparse linear prediction analysis of speech - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Article Dans Une Revue EURASIP Journal on Audio, Speech, and Music Processing Année : 2013

An efficient solution to sparse linear prediction analysis of speech

Vahid Khanagha
  • Fonction : Auteur correspondant
  • PersonId : 865238

Connectez-vous pour contacter l'auteur
Khalid Daoudi

Résumé

We propose an efficient solution to the problem of sparse linear prediction analysis of the speech signal. Our method is based on minimization of a weighted l 2-norm of the prediction error. The weighting function is constructed such that less emphasis is given to the error around the points where we expect the largest prediction errors to occur (the glottal closure instants) and hence the resulting cost function approaches the ideal l 0-norm cost function for sparse residual recovery. We show that the efficient minimization of this objective function (by solving normal equations of linear least squares problem) provides enhanced sparsity level of residuals compared to the l 1-norm minimization approach which uses the computationally demanding convex optimization methods. Indeed, the computational complexity of the proposed method is roughly the same as the classic minimum variance linear prediction analysis approach. Moreover, to show a potential application of such sparse representation, we use the resulting linear prediction coefficients inside a multi-pulse synthesizer and show that the corresponding multi-pulse estimate of the excitation source results in slightly better synthesis quality when compared to the classical technique which uses the traditional non-sparse minimum variance synthesizer.
Fichier principal
Vignette du fichier
1687-4722-2013-3.pdf (703.04 Ko) Télécharger le fichier
1687-4722-2013-3.xml (69.09 Ko) Télécharger le fichier
Origine : Fichiers éditeurs autorisés sur une archive ouverte
Format : Autre
Loading...

Dates et versions

hal-00805054 , version 1 (26-03-2013)

Identifiants

  • HAL Id : hal-00805054 , version 1

Citer

Vahid Khanagha, Khalid Daoudi. An efficient solution to sparse linear prediction analysis of speech. EURASIP Journal on Audio, Speech, and Music Processing, 2013, 2013 (1), pp.3. ⟨hal-00805054⟩
29 Consultations
173 Téléchargements

Partager

Gmail Facebook X LinkedIn More