Skip to Main content Skip to Navigation
Journal articles

Reducing over- and under-estimation of the a priori SNR in speech enhancement techniques

Abstract : Most speech enhancement methods based on short-time spectral modification are generally expressed as a spectral gain depending on the estimate of the local signal-to-noise ratio (SNR) on each frequency bin. Several studies have analyzed the performance of a priori SNR estimation algorithms to improve speech quality and to reduce speech distortions. In this paper, we concentrate on the analysis of over- and under estimation of the a priori SNR in speech enhancement and noise reduction systems. We first show that conventional approaches such as the decision-directed approach proposed by Ephraïm and Malah lead to a biased estimator for the a priori SNR. To reduce this bias, our strategy relies on the introduction of a correction term in the a priori SNR estimate depending on the current state of both the available a posteriori SNR and the estimated a priori one. The proposed solution leads to a bias-compensated a priori SNR estimate, and allows to finely estimating the output speech signal to be very close to the original one on each frequency bin. Such refinement procedure in the a priori SNR estimate can be inserted in any type of spectral gain function to improve the output speech quality. Objective tests under various environments in terms of the Normalized Covariance Metric (NCM) criterion, the Coherence Speech Intelligibility Index (CSII) criterion, the segmental SNR criterion and the Perceptual Evaluation of Speech Quality (PESQ) measure are presented showing the superiority of the proposed method compared to competitive algorithms.
Document type :
Journal articles
Complete list of metadata
Contributor : Pascal Scalart Connect in order to contact the contributor
Submitted on : Tuesday, January 6, 2015 - 10:54:15 AM
Last modification on : Tuesday, October 19, 2021 - 11:58:52 PM



Mohamed Djendi, Pascal Scalart. Reducing over- and under-estimation of the a priori SNR in speech enhancement techniques. Digital Signal Processing, Elsevier, 2014, pp.12. ⟨10.1016/j.dsp.2014.05.007⟩. ⟨hal-01100254⟩



Record views