J. Benesty, J. Chen, Y. Huang, and B. Rafaely, Microphone array signal process, p.435

X. Anguera, C. Wooters, and J. Hernando, Acoustic Beamforming for Speaker Diarization of Meetings, Speech, and Lan- 440 guage Processing, pp.2011-2022, 2007.
DOI : 10.1109/TASL.2007.902460
URL : http://www.xavieranguera.com/papers/transactions_taslp_2007.pdf

K. Kumatani, J. Mcdonough, and B. Raj, Microphone Array Processing for Distant Speech Recognition: From Close-Talking Microphones to Far-Field Sensors, IEEE Signal Processing Magazine, vol.29, issue.6, pp.127-140, 2012.
DOI : 10.1109/MSP.2012.2205285
URL : http://www.lsv.uni-saarland.de/personalPages/kkumatani/pubdata/apsipa2012b.pdf

K. Kinoshita, M. Delcroix, T. Yoshioka, T. Nakatani, A. Sehr et al., The reverb challenge: A common evaluation framework for dereverberation and recognition of reverberant speech, 2013 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, pp.1-4, 2013.
DOI : 10.1109/WASPAA.2013.6701894

J. Barker, R. Marxer, E. Vincent, and S. Watanabe, The third CHiME speech 450 separation and recognition challenge: Dataset, task and baselines, IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), pp.504-511, 2015.
DOI : 10.1109/asru.2015.7404837

S. Gannot, D. Burshtein, and E. Weinstein, Signal enhancement using beamforming and nonstationarity with applications to speech, IEEE Transactions on Signal Processing, vol.49, issue.8, pp.1614-1626, 2001.
DOI : 10.1109/78.934132
URL : http://sipl.technion.ac.il/new/Pictures/Teaching/Projects/2002-3/MicArray-Gannot.pdf

T. Yoshioka and T. Nakatani, Generalization of Multi-Channel Linear Prediction Methods for Blind MIMO Impulse Response Shortening, IEEE Transactions on Audio, Speech, and Language Processing, vol.20, issue.10, pp.2707-2720, 2012.
DOI : 10.1109/TASL.2012.2210879

T. Van-den-bogaert, S. Doclo, J. Wouters, and M. Moonen, Speech enhancement with multichannel Wiener filter techniques in multimicrophone binaural hearing aids, The Journal of the Acoustical Society of America, vol.125, issue.1, pp.360-371, 2009.
DOI : 10.1121/1.3023069

X. Xiao, S. Watanabe, H. Erdogan, L. Lu, J. Hershey et al., Deep beamforming networks for 465 multi-channel speech recognition, IEEE International Conference on Acoustics, Speech and Signal Processing, pp.2016-5745
DOI : 10.1109/icassp.2016.7472778

B. Li, T. N. Sainath, R. J. Weiss, K. W. Wilson, and M. Bacchiani, Neural Network Adaptive Beamforming for Robust Multichannel Speech Recognition, Interspeech 2016, pp.1976-1980, 2016.
DOI : 10.21437/Interspeech.2016-173

J. Heymann, L. Drude, and R. Haeb-umbach, Neural network based spectral mask estimation for acoustic beamforming, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp.2016-196
DOI : 10.1109/ICASSP.2016.7471664

S. Dalmia, I. Illina, and A. Liutkus, Robust ASR using neural network based speech enhancement and feature simulation, IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), pp.2015-482, 2015.
URL : https://hal.archives-ouvertes.fr/hal-01204553

A. A. Nugraha, A. Liutkus, and E. Vincent, Multichannel Audio Source Separation With Deep Neural Networks, IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol.24, issue.9, pp.1652-1664, 2016.
DOI : 10.1109/TASLP.2016.2580946
URL : https://hal.archives-ouvertes.fr/hal-01163369

J. Heymann, L. Drude, A. Chinaev, and R. Haeb-umbach, BLSTM supported GEV beamformer front-end for the 3RD CHiME challenge, 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), pp.2015-485
DOI : 10.1109/ASRU.2015.7404829

H. Erdogan, T. Hayashi, J. R. Hershey, T. Hori, C. Hori et al., Multi-channel speech recognition: LSTMs all the way through, Workshop on Speech Processing in Everday Environments, 2016.

H. Cox, R. M. Zeskind, and M. Owen, Robust adaptive beamforming, IEEE Transactions on Acoustics, Speech, and Signal Processing, vol.35, issue.10, pp.1365-1376, 1987.
DOI : 10.1109/TASSP.1987.1165054

E. Warsitz and R. Haeb-umbach, Blind Acoustic Beamforming Based on Generalized Eigenvalue Decomposition, Speech, 495 and Language Processing, pp.1529-1539, 2007.
DOI : 10.1109/TASL.2007.898454

S. Doclo and M. Moonen, GSVD-based optimal filtering for single and multimicrophone speech enhancement, IEEE Transactions on Signal Processing, vol.50, issue.9, pp.2230-2244, 2002.
DOI : 10.1109/TSP.2002.801937
URL : ftp://ftp.esat.kuleuven.ac.be/pub/SISTA/doclo/reports/01-30.ps.gz

A. Spriet, M. Moonen, and J. Wouters, Spatially pre-processed speech distortion weighted multi-channel Wiener filtering for noise reduction, Signal Processing, vol.84, issue.12, pp.2367-2387, 2004.
DOI : 10.1016/j.sigpro.2004.07.028
URL : http://www.kecl.ntt.co.jp/icl/signal/iwaenc03/cdrom/data/0036.pdf

S. Doclo, A. Spriet, J. Wouters, and M. Moonen, Frequency-domain criterion for the speech distortion weighted multichannel Wiener filter for robust noise reduction, Speech Communication, vol.49, issue.7-8, pp.636-656, 2007.
DOI : 10.1016/j.specom.2007.02.001
URL : https://hal.archives-ouvertes.fr/hal-00499178

]. R. Serizel, M. Moonen, B. Van-dijk, and J. Wouters, Low-rank Approximation Based Multichannel Wiener Filter Algorithms for Noise Reduction with Application in Cochlear Implants, IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol.22, issue.4, pp.505-785, 2014.
DOI : 10.1109/TASLP.2014.2304240
URL : https://hal.archives-ouvertes.fr/hal-01390918

J. R. Jensen, J. Benesty, and M. G. Christensen, Noise Reduction with Optimal Variable Span Linear Filters, IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol.24, issue.4, pp.631-644, 2016.
DOI : 10.1109/TASLP.2015.2505416

J. Benesty, J. Chen, and Y. Huang, Noncausal (frequency-domain) optimal filters, Microphone Array Signal Processing, pp.115-137, 2008.

M. Souden, J. Benesty, and S. Affes, On Optimal Frequency-Domain Multichannel Linear Filtering for Noise Reduction, IEEE Transactions on Audio, Speech, and Language Processing, vol.18, issue.2, pp.260-276, 2010.
DOI : 10.1109/TASL.2009.2025790

J. Benesty, M. Souden, and J. Chen, A perspective on multichannel noise reduction in the time domain, Applied Acoustics, vol.74, issue.3, pp.343-355, 2013.
DOI : 10.1016/j.apacoust.2012.08.002

S. Gannot, E. Vincent, S. Markovich-golan, and A. Ozerov, A Consolidated Perspective on Multimicrophone Speech Enhancement and Source Separation, IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol.25, issue.4, pp.692-730, 2017.
DOI : 10.1109/TASLP.2016.2647702
URL : https://hal.archives-ouvertes.fr/hal-01414179

E. Vincent, S. Watanabe, A. A. Nugraha, J. Barker, and R. Marxer, An analysis of environment, microphone and data simulation mismatches in robust 525 speech recognition, 2016.

S. Braun, K. Kowalczyk, and E. A. Habets, Residual noise control using a parametric multichannel Wiener filter, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp.2015-360
DOI : 10.1109/ICASSP.2015.7177991

S. Markovich, S. Gannot, and I. Cohen, Multichannel Eigenspace Beamforming in a Reverberant Noisy Environment With Multiple Interfering Speech Signals, IEEE Transactions on Audio, Speech, and Language Processing, vol.17, issue.6, pp.1071-1086, 2009.
DOI : 10.1109/TASL.2009.2016395

D. Kingma and J. Ba, Adam: A method for stochastic optimization, arXiv preprint arXiv:1412.6980, 2014. 535 [33] S. Ioffe, C. Szegedy, Batch normalization: Accelerating deep network training by reducing internal covariate shift, 2015.

J. Barker, R. Marxer, E. Vincent, and S. Watanabe, The third ???CHiME??? speech separation and recognition challenge: Analysis and outcomes, Computer Speech & Language, vol.46, 2016.
DOI : 10.1016/j.csl.2016.10.005
URL : https://hal.archives-ouvertes.fr/hal-01382108

B. Cornelis, M. Moonen, and J. Wouters, Performance Analysis of Multichannel Wiener Filter-Based Noise Reduction in Hearing Aids Under Second Order Statistics Estimation Errors, IEEE Transactions on Audio, Speech, and Language Processing, vol.19, issue.5, pp.1368-1381, 2011.
DOI : 10.1109/TASL.2010.2090519

C. H. Taal, R. C. Hendriks, R. Heusdens, and J. Jensen, An evaluation of objective measures for intelligibility prediction of time-frequency weighted noisy speech, The Journal of the Acoustical Society of America, vol.130, issue.5, pp.545-3013, 2011.
DOI : 10.1121/1.3641373