Y. Avargel and I. Cohen, On Multiplicative Transfer Function Approximation in the Short-Time Fourier Transform Domain, IEEE Signal Processing Letters, vol.14, issue.5, pp.337-340, 2007.
DOI : 10.1109/LSP.2006.888292

E. Vincent, M. G. Jafari, S. A. Abdallah, M. D. Plumbley, and M. E. Davies, Probabilistic Modeling Paradigms for Audio Source Separation, Machine Audition: Principles, Algorithms and Systems, pp.162-185, 2010.
DOI : 10.4018/978-1-61520-919-4.ch007

URL : https://hal.archives-ouvertes.fr/inria-00544016

A. Hyvärinen, J. Karhunen, and E. Oja, Independent Component Analysis, 2001.

S. Winter, W. Kellermann, H. Sawada, and S. Makino, MAP-based underdetermined blind source separation of convolutive mixtures by hierarchical clustering and l1-norm minimization, EURASIP Journal on Advances in Signal Processing, p.24717, 2007.

M. Mandel, R. J. Weiss, and D. P. Ellis, Model-Based Expectation-Maximization Source Separation and Localization, IEEE Transactions on Audio, Speech, and Language Processing, vol.18, issue.2, pp.382-394, 2010.
DOI : 10.1109/TASL.2009.2029711

A. Liutkus, B. Badeau, and G. Richard, Gaussian Processes for Underdetermined Source Separation, IEEE Transactions on Signal Processing, vol.59, issue.7, pp.3155-3167, 2011.
DOI : 10.1109/TSP.2011.2119315

URL : https://hal.archives-ouvertes.fr/hal-00643951

D. Ephraim and Y. Malah, Speech enhancement using a minimum mean-square error log-spectral amplitude estimator, IEEE Transactions on Acoustics, Speech, and Signal Processing, vol.33, issue.2, pp.443-445, 1984.
DOI : 10.1109/TASSP.1985.1164550

L. Benaroya, L. Donagh, F. Bimbot, and R. Gribonval, Non negative sparse representation for Wiener based source separation with a single sensor, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)., pp.613-616, 2003.
DOI : 10.1109/ICASSP.2003.1201756

URL : https://hal.archives-ouvertes.fr/inria-00574784

L. Benaroya, F. Bimbot, and R. Gribonval, Audio source separation with a single sensor, IEEE Transactions on Audio, Speech and Language Processing, vol.14, issue.1, pp.191-199, 2006.
DOI : 10.1109/TSA.2005.854110

URL : https://hal.archives-ouvertes.fr/inria-00544949

C. Févotte and J. Cardoso, Maximum likelihood approach for blind audio source separation using time-frequency Gaussian source models, IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2005., 2005.
DOI : 10.1109/ASPAA.2005.1540173

A. Ozerov and C. Févotte, Multichannel Nonnegative Matrix Factorization in Convolutive Mixtures for Audio Source Separation, IEEE Transactions on Audio, Speech, and Language Processing, vol.18, issue.3, pp.550-563, 2010.
DOI : 10.1109/TASL.2009.2031510

N. Duong, E. Vincent, and R. Gribonval, Under-Determined Reverberant Audio Source Separation Using a Full-Rank Spatial Covariance Model, IEEE Transactions on Audio, Speech, and Language Processing, vol.18, issue.7, pp.1830-1840, 2010.
DOI : 10.1109/TASL.2010.2050716

URL : https://hal.archives-ouvertes.fr/inria-00435807

A. Ozerov, E. Vincent, and F. Bimbot, A General Flexible Framework for the Handling of Prior Information in Audio Source Separation, IEEE Transactions on Audio, Speech, and Language Processing, vol.20, issue.4, pp.1118-1133, 2012.
DOI : 10.1109/TASL.2011.2172425

URL : https://hal.archives-ouvertes.fr/inria-00536917

D. Lee and H. Seung, Learning the parts of objects by non-negative matrix factorization, Nature, vol.401, pp.788-791, 1999.

C. Févotte, N. Bertin, and J. Durrieu, Nonnegative Matrix Factorization with the Itakura-Saito Divergence: With Application to Music Analysis, Neural Computation, vol.14, issue.3, pp.793-830, 2009.
DOI : 10.1016/j.sigpro.2007.01.024

T. Yoshioka, T. Nakatani, M. Miyoshi, and H. G. Okuno, Blind Separation and Dereverberation of Speech Mixtures by Joint Optimization, IEEE Transactions on Audio, Speech, and Language Processing, vol.19, issue.1, pp.69-84, 2011.
DOI : 10.1109/TASL.2010.2045183

J. Anemüller and T. Gramss, On-line blind separation of moving sound sources, Proc. Int. Conf. Independent Component Analysis and Blind Source Separation (ICA), 1999.

A. Koutras, E. Dermatas, and G. Kokkinakis, Blind speech separation of moving speakers in real reverberant environments, Proc. IEEE Int, 2000.

K. E. Hild, I. , D. Erdogmus, and J. C. Principe, Blind source separation of time-varying, instantaneous mixtures using an on-line algorithm, Proc. IEEE Int, 2002.

R. Aichner, H. Buchner, S. Araki, and S. Makino, On-line time-domain blind source separation of nonstationary convolved signals, Proc. Int. Conf. Independent Component Analysis and Blind Source Separation (ICA), 2003.

R. E. Prieto and J. Pamornpol, Blind Source Separation for Time-Variant Mixing Systems Using Piecewise Linear Approximations, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005., 2005.
DOI : 10.1109/ICASSP.2005.1416300

R. Mukai, H. Sawada, S. Araki, and S. Makino, Robust real-time blind source separation for moving speakers in a room, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)., 2003.
DOI : 10.1109/ICASSP.2003.1200008

W. Addison and S. Roberts, Blind source separation with non-stationary mixing using wavelets, Proc. Int. Conf. Independent Component Analysis and Blind Source Separation (ICA), 2006.

K. Nakadai, H. Nakajima, Y. Hasegawa, and H. Tsujino, Sound source separation of moving speakers for robot audition, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing, 2009.
DOI : 10.1109/ICASSP.2009.4960426

S. Araki, H. Sawada, R. Mukai, and S. Makino, Underdetermined blind sparse source separation for arbitrarily arranged multiple sensors, Signal Processing, vol.87, issue.8, pp.1833-1847, 2007.
DOI : 10.1016/j.sigpro.2007.02.003

B. Loesch and B. Yang, Online blind source separation based on timefrequency sparseness, Proc. IEEE Int, 2009.

L. Simon and E. Vincent, A General Framework for Online Audio Source Separation, Proc. Int. Conf. on Latent Variable Analysis and Signal Separation, 2012.
DOI : 10.1007/978-3-662-04619-7

URL : https://hal.archives-ouvertes.fr/hal-00655398

S. Markovich-golan, S. Gannot, and I. Cohen, Subspace tracking of multiple sources and its application to speakers extraction, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing, 2010.
DOI : 10.1109/ICASSP.2010.5496044

E. Weinstein, A. Oppenheim, M. Feder, and J. Buck, Iterative and sequential algorithms for multisensor signal enhancement, IEEE Transactions on Signal Processing, vol.42, issue.4, pp.846-859, 1994.
DOI : 10.1109/78.285648

T. Higuchi, N. Takamune, N. Tomohiko, and H. Kameoka, Underdetermined blind separation and tracking of moving sources based on DOA-HMM, Proc. IEEE Int, 2014.

C. Bishop, Pattern Recognition and Machine Learning, 2006.

S. Gannot and M. Moonen, On the application of the unscented Kalman filter to speech processing, Proc. IEEE Int. Workshop on Acoustic Echo and Noise Control (IWAENC), 2003.

D. Kounades-bastian, L. Girin, X. Alameda-pineda, S. Gannot, and R. Horaud, A variational EM algorithm for the separation of moving sound sources, 2015 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), 2015.
DOI : 10.1109/WASPAA.2015.7336936

URL : https://hal.archives-ouvertes.fr/hal-01169764

F. Neeser and J. Massey, Proper complex random processes with applications to information theory, IEEE Transactions on Information Theory, vol.39, issue.4, pp.1293-1302, 1993.
DOI : 10.1109/18.243446

T. Virtanen, Monaural Sound Source Separation by Nonnegative Matrix Factorization With Temporal Continuity and Sparseness Criteria, IEEE Transactions on Audio, Speech and Language Processing, vol.15, issue.3, pp.1066-1074, 2007.
DOI : 10.1109/TASL.2006.885253

N. Mohammadiha, P. Smaragdis, and A. Leijon, Supervised and Unsupervised Speech Enhancement Using Nonnegative Matrix Factorization, IEEE Transactions on Audio, Speech, and Language Processing, vol.21, issue.10, pp.2140-2151, 2013.
DOI : 10.1109/TASL.2013.2270369

L. Parra and C. Spence, Convolutive blind separation of non-stationary sources, IEEE Transactions on Speech and Audio Processing, vol.8, issue.3, pp.320-327, 2000.
DOI : 10.1109/89.841214

]. S. Gannot, D. Burshtein, and E. Weinstein, Signal enhancement using beamforming and nonstationarity with applications to speech, IEEE Transactions on Signal Processing, vol.49, issue.8, pp.1614-1626, 2001.
DOI : 10.1109/78.934132

G. Mclachlan and K. Thriyambakam, The EM algorithm and extensions, 1997.

V. Smidl and A. Quinn, The Variational Bayes Method in Signal Processing, 2006.

A. Hjorungnes and D. Gesbert, Complex-Valued Matrix Differentiation: Techniques and Key Results, IEEE Transactions on Signal Processing, vol.55, issue.6, pp.2740-2746, 2007.
DOI : 10.1109/TSP.2007.893762

N. Sturmel, A. Liutkus, J. Pinel, L. Girin, S. Marchand et al., Linear mixing models for active listening of music productions in realistic studio conditions, Proc. Convention of the Audio Engineering Society (AES), 2012.
URL : https://hal.archives-ouvertes.fr/hal-00790783

J. S. Garofolo, L. F. Lamel, W. M. Fisher, J. G. Fiscus, D. S. Pallett et al., Timit acoustic-phonetic continuous speech corpus, linguistic Data Consortium, 1993.

C. Hummersone, R. Mason, and T. Brookes, A comparison of computational precedence models for source separation in reverberant environments, J. Audio Eng. Soc, vol.61, issue.78, pp.508-520, 2013.

E. Vincent, R. Gribonval, and C. Févotte, Performance measurement in blind audio source separation, IEEE Transactions on Audio, Speech and Language Processing, vol.14, issue.4, pp.1462-1469, 2006.
DOI : 10.1109/TSA.2005.858005

URL : https://hal.archives-ouvertes.fr/inria-00544230

E. Vincent, H. Sawada, P. Bofill, S. Makino, and J. Rosca, First Stereo Audio Source Separation Evaluation Campaign: Data, Algorithms and Results, Proc. Int. Conf. on Independent Component Analysis and Signal Separation (ICA), pp.552-559, 2007.
DOI : 10.1007/978-3-540-74494-8_69

URL : https://hal.archives-ouvertes.fr/inria-00544199

Y. Dorfan and S. Gannot, Tree-Based Recursive Expectation-Maximization Algorithm for Localization of Acoustic Sources, IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol.23, issue.10, pp.1692-1703, 2015.
DOI : 10.1109/TASLP.2015.2444654

T. May, S. Van-de-par, and A. Kohlrausch, A Probabilistic Model for Robust Localization Based on a Binaural Auditory Front-End, IEEE Transactions on Audio, Speech, and Language Processing, vol.19, issue.1, pp.1-13, 2011.
DOI : 10.1109/TASL.2010.2042128

J. Woodruff and D. Wang, Binaural Localization of Multiple Sources in Reverberant and Noisy Environments, IEEE Transactions on Audio, Speech, and Language Processing, vol.20, issue.5, pp.1503-1512, 2012.
DOI : 10.1109/TASL.2012.2183869

J. Traa and P. Smaragdis, Multichannel Source Separation and Tracking With RANSAC and Directional Statistics, IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol.22, issue.12, pp.2233-2243, 2014.
DOI : 10.1109/TASLP.2014.2365701

Y. Dorfan, D. Cherkassky, and S. Gannot, Speaker localization and separation using incremental distributed expectation-maximization, 2015 23rd European Signal Processing Conference (EUSIPCO), pp.1256-1260, 2015.
DOI : 10.1109/EUSIPCO.2015.7362585

J. B. Allen and D. A. Berkley, Image method for efficiently simulating small???room acoustics, The Journal of the Acoustical Society of America, vol.65, issue.4, pp.943-950, 1979.
DOI : 10.1121/1.382599

D. Kounades-bastian-received-the and M. , University of Patras (Greece) in 2013. He is currently working towards his Ph.D. in the Perception Team at INRIA (French Computer Science Research Institute) His research interests are machine learning and signal processing for audio scene analysis, Sc. degree in Computer Science from the Engineering School

X. Alameda-pineda-received-the and M. Sc, Mathematics, and in Telecommunications from Barcelona Tech; and in Computer Science from Grenoble-INP. He did his Ph.D. work in the Perception Team at INRIA (French Computer Science Research Institute), Grenoble, and received his Ph.D. degree in Mathematics and Computer Science from Université Joseph Fourier in 2013. He is currently a Post-Doctoral fellow at the University of Trento His research interests are multimodal machine learning and signal processing for scene analysis

S. Gannot, respectively, all in Electrical Engineering he held a research and teaching position at the Faculty of Electrical Engineering, Technion-Israel Institute of Technology, Haifa, Israel. Currently, he is a Full Professor at the Faculty of Engineering, Bar-Ilan University, Israel, where he is heading the Speech and Signal Processing laboratory and the Signal Processing Track. Prof. Gannot is the recipient of Bar-Ilan University outstanding lecturer award for 2010 and 2014. Prof. Gannot has served as an Associate Editor of the EURASIP Journal of Advances in Signal Processing in 2003-2012, and as an Editor of several special issues on Multi-microphone Speech Processing of the same journal. He has also served as a guest editor of ELSEVIER Speech Communication and Signal Processing journals. Prof. Gannot has served as an Associate Editor of IEEE Transactions on Speech, Audio and Language Processing in 2009-2013. Currently, he is a Senior Area Chair of the same journal. He also serves as a reviewer of many IEEE journals and conferences, Sc. degree (summa cum laude) from the Technion Israel Institute of Technology Prof. Gannot is a member of the Audio and Acoustic Signal Processing (AASP) technical committee of the IEEE since Currently, he serves as the committee vice-chair. He is also a member of the Technical and Steering committee of the International Workshop on Acoustic Signal Enhancement (IWAENC) since 2005 and was the general co-chair of IWAENC held at Tel-Aviv Prof. Gannot has served as the general co-chair of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA) in, 1986.

. Prof, Gannot was selected (with colleagues) to present a tutorial sessions in ICASSP 2012, 2012.