O. Yilmaz and S. Rickard, Blind separation of speech mixtures via time-frequency masking, IEEE Transactions on Signal Processing, vol.52, issue.7, pp.1830-1847, 2014.
DOI : 10.1109/tsp.2004.828896

M. I. Mandel, R. J. Weiss, and D. P. Ellis, Model-based expectation-maximization source separation and localization, IEEE Transactions on Audio, Speech, and Language Processing, vol.18, issue.2, pp.382-394, 2010.
DOI : 10.1109/tasl.2009.2029711

URL : https://academiccommons.columbia.edu/doi/10.7916/D8CJ8PXC/download

S. Winter, W. Kellermann, H. Sawada, and S. Makino, MAPbased underdetermined blind source separation of convolutive mixtures by hierarchical clustering and 1-norm minimization, EURASIP Journal on Applied Signal Processing, issue.1, pp.81-81, 2007.

A. Ozerov and C. Févotte, Multichannel nonnegative matrix factorization in convolutive mixtures for audio source separation, IEEE Transactions on Audio, Speech, and Language Processing, vol.18, issue.3, pp.550-563, 2010.
DOI : 10.1109/tasl.2009.2031510

Y. Avargel and I. Cohen, On multiplicative transfer function approximation in the short-time Fourier transform domain, IEEE Signal Processing Letters, vol.14, issue.5, pp.337-340, 2007.

S. Gannot, D. Burshtein, and E. Weinstein, Signal enhancement using beamforming and nonstationarity with applications to speech, IEEE Transactions on Signal Processing, vol.49, issue.8, pp.1614-1626, 2001.
DOI : 10.1109/78.934132

X. Li, L. Girin, R. Horaud, and S. Gannot, Estimation of relative transfer function in the presence of stationary noise based on segmental power spectral density matrix subtraction, IEEE International Conference on Acoustics, Speech and Signal Processing, pp.320-324, 2015.
URL : https://hal.archives-ouvertes.fr/hal-01119186

H. L. Van-trees, Detection, estimation, and modulation theory, 2004.

S. Gannot, E. Vincent, S. Markovich-golan, and A. Ozerov, A consolidated perspective on multimicrophone speech enhancement and source separation, Speech, and Language Processing, vol.25, pp.692-730, 2017.
DOI : 10.1109/taslp.2016.2647702

URL : https://hal.archives-ouvertes.fr/hal-01414179

N. Duong, E. Vincent, and R. Gribonval, Under-determined reverberant audio source separation using a full-rank spatial covariance model, IEEE Transactions on Audio, Speech, and Language Processing, vol.18, issue.7, pp.1830-1840, 2010.
DOI : 10.1109/tasl.2010.2050716

URL : https://hal.archives-ouvertes.fr/inria-00435807

M. Kowalski, E. Vincent, and R. Gribonval, Beyond the narrowband approximation: Wideband convex methods for underdetermined reverberant audio source separation, IEEE Transactions on Audio, Speech, and Language Processing, vol.18, issue.7, pp.1818-1829, 2010.
DOI : 10.1109/tasl.2010.2050089

URL : https://hal.archives-ouvertes.fr/hal-00435897

S. Arberet, P. Vandergheynst, J. Carrillo, R. E. Thiran, and Y. Wiaux, Sparse reverberant audio source separation via reweighted analysis, IEEE Transactions on Audio, Speech, and Language Processing, issue.7, pp.1391-1402, 2013.
DOI : 10.1109/tasl.2013.2250962

URL : https://infoscience.epfl.ch/record/180378/files/tech-rep-SSCS.pdf

S. Leglaive, R. Badeau, and G. Richard, Multichannel audio source separation: variational inference of time-frequency sources from time-domain observations, IEEE International Conference on Acoustics, Speech and Signal Processing, pp.26-30, 2017.
URL : https://hal.archives-ouvertes.fr/hal-01416347

S. Leglaive, R. Badeau, and G. Richard, Separating timefrequency sources from time-domain convolutive mixtures using non-negative matrix factorization, IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA, pp.264-268, 2017.
DOI : 10.1109/waspaa.2017.8170036

URL : https://hal.archives-ouvertes.fr/hal-01548469

Y. Avargel and I. Cohen, System identification in the short-time Fourier transform domain with crossband filtering, IEEE Transactions on Audio, Speech, and Language Processing, vol.15, pp.1305-319, 2007.

R. Talmon, I. Cohen, and S. Gannot, Relative transfer function identification using convolutive transfer function approximation, IEEE Transactions on Audio, Speech, and Language Processing, vol.17, issue.4, pp.546-555, 2009.
DOI : 10.1109/tasl.2008.2009576

X. Li, L. Girin, and R. Horaud, Audio source separation based on convolutive transfer function and frequency-domain lasso optimization', IEEE International Conference on Acoustics, Speech, and Signal Processing, pp.541-545, 2017.
DOI : 10.1109/icassp.2017.7952214

URL : https://hal.archives-ouvertes.fr/hal-01430754

X. Li, L. Girin, S. Gannot, and R. Horaud, Multichannel speech separation and enhancement using the convolutive transfer function', Speech, and Language Processing, 2018.
DOI : 10.1109/taslp.2019.2892412

URL : https://hal.archives-ouvertes.fr/hal-01799809

R. Talmon, I. Cohen, and S. Gannot, Convolutive transfer function generalized sidelobe canceler, vol.17, pp.1420-1434, 2009.
DOI : 10.1109/tasl.2009.2020891

B. Schwartz, S. Gannot, and E. A. Habets, Online speech dereverberation using kalman filter and EM algorithm, IEEE/ACM Transactions on Audio, Speech and Language Processing, vol.23, issue.2, pp.394-406, 2015.
DOI : 10.1109/taslp.2014.2372342

R. Badeau and M. D. Plumbley, Multichannel high-resolution NMF for modeling convolutive mixtures of non-stationary signals in the time-frequency domain, IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol.22, issue.11, pp.1670-1680, 2014.

T. Higuchi and H. Kameoka, Joint audio source separation and dereverberation based on multichannel factorial hidden Markov model, IEEE International Workshop on Machine Learning for Signal Processing, pp.1-6, 2014.
DOI : 10.1109/mlsp.2014.6958927

X. Li, S. Gannot, L. Girin, and R. Horaud, Multichannel identification and nonnegative equalization for dereverberation and noise reduction based on convolutive transfer function, Speech, and Language Processing, vol.26, pp.1755-1768, 2018.
DOI : 10.1109/taslp.2018.2839362

URL : https://hal.archives-ouvertes.fr/hal-01645749

X. Li, L. Girin, and R. Horaud, An EM algorithm for audio source separation based on the convolutive transfer function, IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA, pp.56-60, 2017.
DOI : 10.1109/waspaa.2017.8169994

URL : https://hal.archives-ouvertes.fr/hal-01568818

W. G. Gardner and K. D. Martin, HRTF measurements of a KEMAR dummy-head microphone, The Journal of the Acoustical Society of America, vol.97, issue.6, pp.3907-3908, 1995.
DOI : 10.1121/1.412407

D. Campbell, The roomsim user guide (v3.3, 2004.

J. S. Garofolo, L. F. Lamel, W. M. Fisher, J. G. Fiscus, D. S. Pallett et al., Getting started with the DARPA TIMIT CD-ROM: An acoustic phonetic continuous speech database, p.107, 1988.

D. R. Morgan, J. Benesty, and M. M. Sondhi, On the evaluation of estimated impulse responses, IEEE Signal processing letters, issue.5, pp.174-176, 1998.

E. Vincent, R. Gribonval, and C. Févotte, Performance measurement in blind audio source separation, IEEE transactions on audio, speech, and language processing, vol.14, pp.1462-1469, 2006.
DOI : 10.1109/tsa.2005.858005

URL : https://hal.archives-ouvertes.fr/inria-00544230

X. Li, L. Girin, R. Horaud, and S. Gannot, Multiple-speaker localization based on direct-path features and likelihood maximization with spatial sparsity regularization, Speech, and Language Processing, vol.25, pp.1997-2012, 2017.
URL : https://hal.archives-ouvertes.fr/hal-01413417