R. Anguera, X. Wooters, C. Hernando, and J. , Acoustic beamforming for speaker diarization of meetings. Audio, Speech, and Language Processing, IEEE Transactions on, vol.15, issue.7, pp.2011-2022, 2007.

D. Baby, T. Virtanen, and H. Van-hamme, Coupled dictionarybased speech enhancement for CHiME-3 challenge, 2015.

D. Bagchi, M. I. Mandel, Z. Wang, Y. He, A. Plummer et al., Combining spectral feature mapping and multi-channel modelbased source separation for noise-robust automatic speech recognition, IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2015, pp.2015-496, 2015.

H. Barfuss, C. Huemmer, A. Schwarz, and W. Kellermann, Robust coherence-based spectral enhancement for distant speech recognition, 2015.

J. Barker, R. Marxer, E. Vincent, and S. Watanabe, The third ???CHiME??? speech separation and recognition challenge: Dataset, task and baselines, 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), pp.504-511, 2015.
DOI : 10.1109/ASRU.2015.7404837
URL : https://hal.archives-ouvertes.fr/hal-01211376

J. Barker, E. Vincent, N. Ma, H. Christensen, and P. Green, The PASCAL CHiME speech separation and recognition challenge, Computer Speech & Language, vol.27, issue.3, pp.621-633, 2013.
DOI : 10.1016/j.csl.2012.10.004
URL : https://hal.archives-ouvertes.fr/hal-00646370

C. Martinez, A. Meyer, and B. , Mutual benefits of auditory spectrotemporal Gabor features and deep learning for the 3rd CHiME challenge, 2015.

J. H. Dibiase, H. F. Silverman, and M. S. Brandstein, Robust localization in reverberent rooms, Microphone Arrays: Techniques and Applications. Spring-Verlag, pp.157-180, 2001.

J. Du, Q. Wang, Y. Tu, X. Bao, L. Dai et al., An information fusion approach to recognizing microphone array speech in the CHiME-3 challenge based on a deep learning framework, 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), pp.430-435, 2015.
DOI : 10.1109/ASRU.2015.7404827

A. El-desoky-mousa, E. Marchi, and B. Schuller, The ICSTM+TUM+UP approach to the 3rd CHiME challenge: Single-channel LSTM speech enhancement with multi-channel correlation shaping dereverberation and LSTM language models, 2015.

H. Fletcher and W. A. Manson, Loudness, its definition, measurement and calculation, Journal of the Acoustical Society of America, vol.82, issue.5, pp.82-108, 1933.

M. Frigge, D. C. Hoaglin, and B. Iglewicz, Some implementations of the boxplot, The American Statistician, vol.43, issue.1, pp.50-54, 1989.

Y. Fujita, R. Takashima, T. Homma, R. Ikeshita, Y. Kawaguchi et al., Unified ASR system using LGM-based source separation, noise-robust feature extraction, and word hypothesis selection, 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), pp.416-422, 2015.
DOI : 10.1109/ASRU.2015.7404825

M. Hasegawa-johnson and M. Fleck, The Internatoinal Speech LEXicon, 2007.

H. Hermansky and N. Morgan, RASTA processing of speech. Speech and Audio Processing, IEEE Transactions on, vol.2, issue.4, pp.578-589, 1994.

J. Heymann, L. Drude, A. Chinaev, and R. Haeb-umbach, BLSTM supported GEV beamformer front-end for the 3RD CHiME challenge, 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), pp.444-451, 2015.
DOI : 10.1109/ASRU.2015.7404829

H. Hirsch and D. Pearce, The Aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions, Proceedings of the 6th International Conference on Spoken Language Processing (ICSLP), pp.29-32, 2000.

T. Hori, Z. Chen, H. Erdogan, J. R. Hershey, J. Le-roux et al., The MERL/SRI system for the 3RD CHiME challenge using beamforming, robust feature extraction, and advanced speech recognition, 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), pp.475-481, 2015.
DOI : 10.1109/ASRU.2015.7404833

S. Jalalvand, D. Falavigna, M. Matassoni, P. Svaizer, and M. Omologo, Boosted acoustic model learning and hypotheses rescoring on the CHiME3 task, 2015 IEEE Workshop on Automatic Speech Recognition and Understanding , ASRU 2015, pp.409-415, 2015.

C. Kim and R. M. Stern, Power-normalized cepstral coefficients (PNCC) for robust speech recognition, 2012 IEEE International Conference on. IEEE, pp.4101-4104, 2012.

B. Loesch and B. Yang, Adaptive Segmentation and Separation of Determined Convolutive Mixtures under Dynamic Conditions, Proceedings of the 9th International Conference on Latent Variable Analysis and Signal Separation, pp.41-48, 2010.
DOI : 10.1007/978-3-642-15995-4_6

N. Ma, R. Marxer, J. Barker, and G. J. Brown, Exploiting synchrony spectra and deep neural networks for noise-robust automatic speech recognition, 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), pp.490-495, 2015.
DOI : 10.1109/ASRU.2015.7404835

X. Mestre and M. A. Lagunas, On diagonal loading for minimum variance beamformers, Proceedings of the 3rd IEEE International Symposium on Signal Processing and Information Technology (IEEE Cat. No.03EX795), pp.459-462, 2003.
DOI : 10.1109/ISSPIT.2003.1341157

T. Mikolov, M. Karafiát, L. Burget, J. Cernock-`-cernock-`-y, and S. Khudanpur, Recurrent neural network based language model, Proceedings of the 11th Annual Conference of the International Speech Communication Association, pp.1045-1048, 2010.

A. Misbullah, J. Chien, and . Unpublished, Deep feedforward and recurrent neural networks for speech recognition

N. Moritz, S. Gerlach, K. Adiloglu, J. Anemüller, B. Kollmeier et al., A CHiME-3 challenge system: Long-term acoustic features for noise robust automatic speech recognition, 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), pp.2015-468, 2015.
DOI : 10.1109/ASRU.2015.7404832

D. Mostefa, N. Moreau, K. Choukri, G. Potamianos, S. M. Chu et al., The CHIL audiovisual corpus for lecture and meeting analysis inside smart rooms, Language Resources and Evaluation, vol.41, issue.3-4, pp.3-4, 2007.
DOI : 10.1007/s10579-007-9054-4

Z. Pang and F. Zhu, Noise-robust ASR for the third 'CHiME' challenge exploiting time-frequency masking based multi-channel speech enhancement and recurrent neural network, 2015.

N. Parihar, J. Picone, D. Pearce, and H. G. Hirsch, Performance analysis of the Aurora large vocabulary baseline system, Proceedings of the, 2004.

L. Pfeifenberger, T. Schrank, M. Zöhrer, M. Hagmüller, and F. Pernkopf, Multi-channel speech processing architectures for noise robust speech recognition: 3rd CHiME challenge results, 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), pp.452-459, 2015.
DOI : 10.1109/ASRU.2015.7404830

D. Povey, A. Ghoshal, G. Boulianne, L. Burget, O. Glembek et al., The Kaldi Speech Recognition Toolkit, IEEE 2011 Workshop on Automatic Speech Recognition and Understanding. IEEE Signal Processing Society, p.11, 2011.

A. Prudnikov, M. Korenevsky, and S. Aleinik, Adaptive beamforming and adaptive training of DNN acoustic models for enhanced multichannel noisy speech recognition, 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), pp.401-408, 2015.
DOI : 10.1109/ASRU.2015.7404823

S. Renals, T. Hain, and H. Bourlard, Interpretation of Multiparty Meetings the AMI and Amida Projects, 2008 Hands-Free Speech Communication and Microphone Arrays, pp.115-118, 2008.
DOI : 10.1109/HSCMA.2008.4538700

S. Sivasankaran, A. A. Nugraha, E. Vincent, J. A. Morales-cordovilla, S. Dalmia et al., Robust ASR using neural network based speech enhancement and feature simulation, 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), pp.2015-482, 2015.
DOI : 10.1109/ASRU.2015.7404834
URL : https://hal.archives-ouvertes.fr/hal-01204553

C. H. Taal, R. C. Hendriks, R. Heusdens, and J. Jensen, An algorithm for intelligibility prediction of time?frequency weighted noisy speech. Audio, Speech, and Language Processing, IEEE Transactions on, vol.19, issue.7, pp.2125-2136, 2011.

Y. Tachioka, H. Kanagawa, and J. Ishii, The overview of the MELCO ASR system for the third CHiME challenge, 2015.

J. Taghia and R. Martin, Objective intelligibility measures based on mutual information for speech subjected to speech enhancement processing. Audio, Speech, and Language Processing, IEEE/ACM Transactions on, vol.22, issue.1, pp.6-16, 2014.

H. D. Tran, J. Dennis, and L. Yiren, unpublished. A comparative study of multichannel processing methods for noisy automatic speech recognition on the third CHiME challenge

K. Vesel´yvesel´y, A. Ghoshal, L. Burget, and D. Povey, Sequence-discriminative training of deep neural networks, Proceedings of the 14th Annual Conference of the International Speech Communication Association (INTER- SPEECH 2013, pp.2345-2349, 2013.

E. Vincent, J. Barker, S. Watanabe, J. Le-roux, F. Nesta et al., The second ‘CHiME’ speech separation and recognition challenge: An overview of challenge systems and outcomes, 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, pp.162-167, 2013.
DOI : 10.1109/ASRU.2013.6707723

E. Vincent, J. Barker, S. Watanabe, J. Le-roux, F. Nesta et al., The second ‘chime’ speech separation and recognition challenge: Datasets, tasks and baselines, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing, pp.126-130, 2013.
DOI : 10.1109/ICASSP.2013.6637622

E. Vincent, S. Watanabe, A. Nugraha, J. Barker, and R. Marxer, An analysis of environment, microphone and data simulation mismatches in robust speech recognition, Computer Speech & Language
DOI : 10.1016/j.csl.2016.11.005
URL : https://hal.archives-ouvertes.fr/hal-01399180

T. T. Vu, B. Bigot, and E. S. Chng, Speech enhancement using beamforming and non negative matrix factorization for robust speech recognition in the CHiME-3 challenge, 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), pp.423-429, 2015.
DOI : 10.1109/ASRU.2015.7404826

X. Wang, C. Wu, P. Zhang, Z. Wang, Y. Liu et al., Noise robust IOA/CAS speech separation and recognition system for the third 'CHIME' challenge, 2015.

T. Yoshioka, N. Ito, M. Delcroix, A. Ogawa, K. Kinoshita et al., The NTT CHiME-3 system: Advances in speech enhancement and recognition for mobile multi-microphone devices, 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), pp.2015-436, 2015.
DOI : 10.1109/ASRU.2015.7404828

S. Zhao, X. Xiao, Z. Zhang, T. N. Nguyen, X. Zhong et al., Robust speech recognition using beamforming with adaptive microphone gains and multichannel noise reduction, 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), pp.2015-460, 2015.
DOI : 10.1109/ASRU.2015.7404831

Y. Zhuang, Y. You, T. Tan, M. Bi, S. Bu et al., System combination for multi-channel noise robust ASR, pp.2015-2022, 2015.