Voice Activity Detection Based on Statistical Likelihood Ratio With Adaptive Thresholding - Archive ouverte HAL Access content directly
Conference Papers Year :

Voice Activity Detection Based on Statistical Likelihood Ratio With Adaptive Thresholding

(1) , (1) , (1, 2) , (3)
1
2
3

Abstract

Statistical likelihood ratio test is a widely used voice activity detection (VAD) method, in which the likelihood ratio of the current temporal frame is compared with a threshold. A fixed threshold is always used, but this is not suitable for various types of noise. In this paper, an adaptive threshold is proposed as a function of the local statistics of the likelihood ratio. This threshold represents the upper bound of the likelihood ratio for the non-speech frames, whereas it remains generally lower than the likelihood ratio for the speech frames. As a result, a high non-speech hit rate can be achieved, while maintaining speech hit rate as large as possible.
Fichier principal
Vignette du fichier
vad_slr.pdf (426.9 Ko) Télécharger le fichier
Origin : Files produced by the author(s)
Loading...

Dates and versions

hal-01349776 , version 1 (28-07-2016)

Identifiers

Cite

Xiaofei Li, Radu Horaud, Laurent Girin, Sharon Gannot. Voice Activity Detection Based on Statistical Likelihood Ratio With Adaptive Thresholding. IWAENC 2016 - International Workshop on Acoustic Signal Enhancement (IWAENC), Sep 2016, Xi'an, China. pp.1-5, ⟨10.1109/IWAENC.2016.7602911⟩. ⟨hal-01349776⟩
606 View
1063 Download

Altmetric

Share

Gmail Facebook Twitter LinkedIn More