An Efficient Solution to Sparse Linear Prediction Analysis of Speech

Vahid Khanagha; Khalid Daoudi

doi:10.1186/1687-4722-2013-3

Journal Articles EURASIP Journal on Audio, Speech, and Music Processing Year : 2013

An Efficient Solution to Sparse Linear Prediction Analysis of Speech

(1) , (1)

Vahid Khanagha

Function : Author
PersonId : 865238

Geometry and Statistics in acquisition data

Khalid Daoudi

Function : Author
PersonId : 1329075
ORCID : 0000-0003-3536-1060
IdRef : 115483500

Geometry and Statistics in acquisition data

Abstract

We propose an efficient closed-form solution to the problem of sparse linear prediction analysis of the speech signal. Our method is based on minimization of a weighted l2-norm of the prediction error. The weighting function is constructed such that less emphasis is given to the error around the points where we expect the largest prediction errors to occur (the glottal closure instants) and hence the resulting cost function approaches the ideal l0-norm cost function for sparse residual recovery. We show that the minimization of such a mathematically tractable objective function (by solving normal equations of linear least squares problem) provides enhanced sparsity level of residuals compared to the l1-norm minimization approach which uses the computationally demanding convex optimization methods. Indeed, the computational complexity of the proposed method is roughly the same as the classic minimum variance linear prediction analysis approach. Moreover, to show a potential application of such sparse representation, we use the resulting linear prediction coefficients inside a multi-pulse coder and show that the resulting coder achieves better coding quality compared to the classical Multi-pulse Excitation coder which uses the traditional minimum variance synthesizer.

Domains

Signal and Image Processing Signal and Image processing

Vahid Khanagha : Connect in order to contact the contributor

https://inria.hal.science/hal-00709168

Submitted on : Monday, June 18, 2012-9:52:39 AM

Last modification on : Friday, February 2, 2024-3:34:19 AM

Dates and versions

hal-00709168 , version 1 (18-06-2012)

Identifiers

HAL Id : hal-00709168 , version 1
DOI : 10.1186/1687-4722-2013-3

Cite

Vahid Khanagha, Khalid Daoudi. An Efficient Solution to Sparse Linear Prediction Analysis of Speech. EURASIP Journal on Audio, Speech, and Music Processing, 2013, 3, ⟨10.1186/1687-4722-2013-3⟩. ⟨hal-00709168⟩

Export

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-RENNES1 INRIA IRISA INRIA2 UR1-MATH-STIC UR1-UFR-ISTIC UNIV-RENNES UR1-MATH-NUM

227 View

0 Download

An Efficient Solution to Sparse Linear Prediction Analysis of Speech

Abstract

Domains

Dates and versions

Identifiers

Cite

Export

Collections

Altmetric

Share