T. Kinnunen, E. Chernenko, M. Tuononen, P. Fränti, and H. Li, Voice activity detection using MFCC features and support vector machine, Int. Conf. on Speech and Computer (SPECOM07), pp.556-561, 2007.

S. Mousazadeh and I. Cohen, Voice Activity Detection in Presence of Transient Noise Using Spectral Clustering, IEEE Transactions on Audio, Speech, and Language Processing, vol.21, issue.6, pp.1261-1271, 2013.
DOI : 10.1109/TASL.2013.2248717

D. Dov, R. Talmon, and I. Cohen, Audio-Visual Voice Activity Detection Using Diffusion Maps, IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol.23, issue.4, pp.732-745, 2015.
DOI : 10.1109/TASLP.2015.2405481

F. Germain, L. Dennis, . Sun, J. Gautham, and . Mysore, Speaker and noise independent voice activity detection, INTERSPEECH, pp.732-736, 2013.

A. Prasanta-kumar-ghosh, S. Tsiartas, and . Narayanan, Robust Voice Activity Detection Using Long-Term Signal Variability, IEEE Transactions on Audio, Speech, and Language Processing, vol.19, issue.3, pp.600-613, 2011.
DOI : 10.1109/TASL.2010.2052803

Y. Ma and A. Nishihara, Efficient voice activity detection algorithm using long-term spectral flatness measure, EURASIP Journal on Audio, Speech, and Music Processing, vol.2013, issue.1, pp.1-18, 2013.

J. Ramírez, C. José, C. Segura, A. D. Benítez, L. Torre et al., Efficient voice activity detection algorithms using long-term speech information, Speech Communication, vol.42, issue.3-4, pp.271-287, 2004.
DOI : 10.1016/j.specom.2003.10.002

D. Ying, Y. Yan, J. Dang, K. Frank, and . Soong, Voice Activity Detection Based on an Unsupervised Learning Framework, IEEE Transactions on Audio, Speech, and Language Processing, vol.19, issue.8, pp.2624-2633, 2011.
DOI : 10.1109/TASL.2011.2125953

J. Sohn, N. S. Kim, and W. Sung, A statistical model-based voice activity detection, IEEE International Conference on Acoustics, Speech, and Signal Processing, pp.1-3, 1999.
DOI : 10.1109/97.736233

C. Richard, R. Hendriks, J. Heusdens, and . Jensen, MMSE based noise psd tracking with low complexity, IEEE International Conference on Acoustics Speech and Signal Processing, pp.4266-4269, 2010.

T. Gerkmann, C. Richard, and . Hendriks, Unbiased MMSE-Based Noise Power Estimation With Low Complexity and Low Tracking Delay, IEEE Transactions on Audio, Speech, and Language Processing, vol.20, issue.4, pp.1383-1393, 2012.
DOI : 10.1109/TASL.2011.2180896

X. Li, L. Girin, S. Gannot, and R. Horaud, Non-stationary noise power spectral density estimation based on regional statistics, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2016.
DOI : 10.1109/ICASSP.2016.7471661
URL : https://hal.archives-ouvertes.fr/hal-01250892

R. Martin, Noise power spectral density estimation based on optimal smoothing and minimum statistics, IEEE Transactions on Speech and Audio Processing, vol.9, issue.5, pp.504-512, 2001.
DOI : 10.1109/89.928915

Y. Ephraim and D. Malah, Speech enhancement using a minimum-mean square error short-time spectral amplitude estimator, IEEE Transactions on Acoustics, Speech, and Signal Processing, vol.32, issue.6, pp.1109-1121, 1984.
DOI : 10.1109/TASSP.1984.1164453

J. S. Garofolo, L. F. Lamel, W. M. Fisher, J. G. Fiscus, D. S. Pallett et al., Getting started with the DARPA TIMIT CD-ROM: An acoustic phonetic continuous speech database, National Institute of Standards and Technology (NIST), vol.107, 1988.

A. Varga, J. Herman, and . Steeneken, Assessment for automatic speech recognition: II. NOISEX-92: A database and an experiment to study the effect of additive noise on speech recognition systems, Speech Communication, vol.12, issue.3, pp.247-251, 1993.
DOI : 10.1016/0167-6393(93)90095-3