Reducing over- and under-estimation of the a priori SNR in speech enhancement techniques

Abstract : Most speech enhancement methods based on short-time spectral modification are generally expressed as a spectral gain depending on the estimate of the local signal-to-noise ratio (SNR) on each frequency bin. Several studies have analyzed the performance of a priori SNR estimation algorithms to improve speech quality and to reduce speech distortions. In this paper, we concentrate on the analysis of over- and under estimation of the a priori SNR in speech enhancement and noise reduction systems. We first show that conventional approaches such as the decision-directed approach proposed by Ephraïm and Malah lead to a biased estimator for the a priori SNR. To reduce this bias, our strategy relies on the introduction of a correction term in the a priori SNR estimate depending on the current state of both the available a posteriori SNR and the estimated a priori one. The proposed solution leads to a bias-compensated a priori SNR estimate, and allows to finely estimating the output speech signal to be very close to the original one on each frequency bin. Such refinement procedure in the a priori SNR estimate can be inserted in any type of spectral gain function to improve the output speech quality. Objective tests under various environments in terms of the Normalized Covariance Metric (NCM) criterion, the Coherence Speech Intelligibility Index (CSII) criterion, the segmental SNR criterion and the Perceptual Evaluation of Speech Quality (PESQ) measure are presented showing the superiority of the proposed method compared to competitive algorithms.
Type de document :
Article dans une revue
Digital Signal Processing, Elsevier, 2014, pp.12. 〈10.1016/j.dsp.2014.05.007〉
Liste complète des métadonnées

https://hal.inria.fr/hal-01100254
Contributeur : Pascal Scalart <>
Soumis le : mardi 6 janvier 2015 - 10:54:15
Dernière modification le : mercredi 16 mai 2018 - 11:23:27

Identifiants

Citation

Mohamed Djendi, Pascal Scalart. Reducing over- and under-estimation of the a priori SNR in speech enhancement techniques. Digital Signal Processing, Elsevier, 2014, pp.12. 〈10.1016/j.dsp.2014.05.007〉. 〈hal-01100254〉

Partager

Métriques

Consultations de la notice

722