Reducing over- and under-estimation of the a priori SNR in speech enhancement techniques - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Article Dans Une Revue Digital Signal Processing Année : 2014

Reducing over- and under-estimation of the a priori SNR in speech enhancement techniques

Résumé

Most speech enhancement methods based on short-time spectral modification are generally expressed as a spectral gain depending on the estimate of the local signal-to-noise ratio (SNR) on each frequency bin. Several studies have analyzed the performance of a priori SNR estimation algorithms to improve speech quality and to reduce speech distortions. In this paper, we concentrate on the analysis of over- and under estimation of the a priori SNR in speech enhancement and noise reduction systems. We first show that conventional approaches such as the decision-directed approach proposed by Ephraïm and Malah lead to a biased estimator for the a priori SNR. To reduce this bias, our strategy relies on the introduction of a correction term in the a priori SNR estimate depending on the current state of both the available a posteriori SNR and the estimated a priori one. The proposed solution leads to a bias-compensated a priori SNR estimate, and allows to finely estimating the output speech signal to be very close to the original one on each frequency bin. Such refinement procedure in the a priori SNR estimate can be inserted in any type of spectral gain function to improve the output speech quality. Objective tests under various environments in terms of the Normalized Covariance Metric (NCM) criterion, the Coherence Speech Intelligibility Index (CSII) criterion, the segmental SNR criterion and the Perceptual Evaluation of Speech Quality (PESQ) measure are presented showing the superiority of the proposed method compared to competitive algorithms.
Fichier non déposé

Dates et versions

hal-01100254 , version 1 (06-01-2015)

Identifiants

Citer

Mohamed Djendi, Pascal Scalart. Reducing over- and under-estimation of the a priori SNR in speech enhancement techniques. Digital Signal Processing, 2014, 32, pp.12. ⟨10.1016/j.dsp.2014.05.007⟩. ⟨hal-01100254⟩
167 Consultations
0 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More