M. Abramowitz and &. I. Stegun, Handbook of Mathematical Functions, American Journal of Physics, vol.34, issue.2, 1965.
DOI : 10.1119/1.1972842

&. S. Addison and . Roberts, Blind source separation with non-stationary mixing using wavelets, Int. Conf. Independent Component Analysis and Blind Source Separation (ICA), 2006.

R. Aichner, H. Buchner, S. Araki, and &. S. Makino, On-line timedomain blind source separation of nonstationary convolved signals, Int. Conf. Independent Component Analysis and Blind Source Separation (ICA), 2003.

]. J. Allen-79, &. D. Allen, and . Berkley, Image method for efficiently simulating small?room acoustics, The Journal of the Acoustical Society of America, vol.65, issue.4, pp.943-950, 1979.
DOI : 10.1121/1.382599

]. J. Anemüller-99, &. T. Anemüller, and . Gramss, On-line blind separation of moving sound sources, Int. Conf. Independent Component Analysis and Blind Source Separation (ICA), 1999.

X. A. Miro, S. Bozonnet, N. Evans, C. Fredouille, G. Friedland et al., Speaker Diarization: A Review of Recent Research, IEEE Transactions on Audio, Speech, and Language Processing, vol.20, issue.2, pp.356-371, 2012.
DOI : 10.1109/TASL.2011.2125954

S. Araki, R. Mukai, S. Makino, T. Nishikawa, and &. H. Saruwatari, The fundamental limitation of frequency domain blind source separation for convolutive mixtures of speech, IEEE Transactions on Speech and Audio Processing, vol.11, issue.2, pp.109-116, 2003.
DOI : 10.1109/TSA.2003.809193

S. Araki, H. Sawada, R. Mukai, and &. S. Makino, Underdetermined blind sparse source separation for arbitrarily arranged multiple sensors, Signal Processing, vol.87, issue.8, pp.1833-1847, 2007.
DOI : 10.1016/j.sigpro.2007.02.003

S. Arberet, A. Ozerov, N. Q. Duong, E. Vincent, R. Gribonval et al., Nonnegative matrix factorization and spatial covariance model for under-determined reverberant audio source separation, 10th International Conference on Information Science, Signal Processing and their Applications (ISSPA 2010), 2010.
DOI : 10.1109/ISSPA.2010.5605570
URL : https://hal.archives-ouvertes.fr/inria-00541436

L. Benaroya, L. Donagh, F. Bimbot, and &. R. Gribonval, Non negative sparse representation for Wiener based source separation with a single sensor, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)., 2003.
DOI : 10.1109/ICASSP.2003.1201756
URL : https://hal.archives-ouvertes.fr/inria-00574784

N. Bertin, R. Badeau, and &. E. Vincent, Enforcing Harmonicity and Smoothness in Bayesian Non-Negative Matrix Factorization Applied to Polyphonic Music Transcription, IEEE Transactions on Audio, Speech, and Language Processing, vol.18, issue.3, pp.538-549, 2010.
DOI : 10.1109/TASL.2010.2041381
URL : https://hal.archives-ouvertes.fr/inria-00557088

]. C. Bilen, A. Ozerov, and &. P. Pérez, Automatic allocation of NTF components for user-guided audio source separation, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2016.
DOI : 10.1109/ICASSP.2016.7471722
URL : https://hal.archives-ouvertes.fr/hal-01259430

C. E. Cherry, Some Experiments on the Recognition of Speech, with One and with Two Ears, The Journal of the Acoustical Society of America, vol.25, issue.5, pp.975-979, 1953.
DOI : 10.1121/1.1907229

L. Drude, A. Chinaev, D. H. Tran-vu, and &. R. Haeb-umbach, Source counting in speech mixtures using a variational EM approach for complex WATSON mixture models, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2014.
DOI : 10.1109/ICASSP.2014.6854924

]. N. Duong, E. Vincent, and &. R. Gribonval, Under-Determined Reverberant Audio Source Separation Using a Full-Rank Spatial Covariance Model, IEEE Transactions on Audio, Speech, and Language Processing, vol.18, issue.7, pp.1830-1840, 2010.
DOI : 10.1109/TASL.2010.2050716
URL : https://hal.archives-ouvertes.fr/inria-00541865

&. D. Ephraim and . Malah, Speech enhancement using a minimum-mean square error short-time spectral amplitude estimator, IEEE Transactions on Acoustics, Speech, and Signal Processing, vol.32, issue.6, pp.443-445, 1984.
DOI : 10.1109/TASSP.1984.1164453

]. G. Evangelidis, D. Kounades-bastian, R. Horaud, and &. E. Psarakis, A Generative Model for the Joint Registration of Multiple Point Sets, European Conf. Computer Vision (ECCV), 2014.
DOI : 10.1007/978-3-319-10584-0_8
URL : https://hal.archives-ouvertes.fr/hal-01019661

C. Févotte, N. Bertin, and &. Durrieu, Nonnegative Matrix Factorization with the Itakura-Saito Divergence: With Application to Music Analysis, Neural Computation, vol.14, issue.3, pp.793-830, 2009.
DOI : 10.1016/j.sigpro.2007.01.024

S. Gannot, D. Burshtein, and &. E. Weinstein, Signal enhancement using beamforming and nonstationarity with applications to speech, IEEE Transactions on Signal Processing, vol.49, issue.8, pp.1614-1626, 2001.
DOI : 10.1109/78.934132
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.455.2627

S. Gannot and &. M. Moonen, On the application of the unscented Kalman filter to speech processing, IEEE Int. Workshop Acoustic Echo and Noise Control (IWAENC), 2003.

S. Gannot, E. Vincent, S. Markovich-golan, and &. A. Ozerov, A consolidated perspective on multi-microphone speech enhancement and source separation, IEEE Trans. Audio, Speech, Lang. Process, 2017.
DOI : 10.1109/taslp.2016.2647702
URL : https://hal.archives-ouvertes.fr/hal-01414179

L. Girin and &. R. Badeau, On the Use of Latent Mixing Filters in Audio Source Separation of Moving Sound Sources, 13th Int. Conf. on Latent Variable Analysis and Signal Separation (LVA/ICA) IEEE Workshop Applicat. Signal Process. to Audio and Acoust. (WASPAA), 2015.

]. D. Kounades-bastian-16a, L. Kounades-bastian, X. Girin, S. Alameda-pineda, &. R. Gannot et al., An inverse-gamma source variance prior with factorized parametrization for audio source separation, IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), 2016.

]. D. Kounades-bastian-16b, L. Kounades-bastian, X. Girin, S. Alameda-pineda, &. R. Gannot et al., A Variational EM Algorithm for the Separation of Time-Varying Convolutive Audio Mixtures, IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol.24, issue.8, pp.1408-1423, 2016.
DOI : 10.1109/TASLP.2016.2554286

L. Bastian, X. Girin, S. Alameda-pineda, &. R. Gannot, and . Horaud, An EM Algorithm for Joint Source Separation and diarisation of Multichannel Convolutive Speech Mixtures, IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), 2017.
URL : https://hal.archives-ouvertes.fr/hal-01430761

M. Kowalski, E. Vincent, and &. R. Gribonval, Beyond the Narrowband Approximation: Wideband Convex Methods for Under-Determined Reverberant Audio Source Separation, IEEE Transactions on Audio, Speech, and Language Processing, vol.18, issue.7, pp.1818-1829, 2010.
DOI : 10.1109/TASL.2010.2050089
URL : https://hal.archives-ouvertes.fr/hal-00435897

D. Lee and &. H. Seung, Algorithms for non-negative matrix factorization, Advances in Neural Information Process. Systems, pp.556-562, 2001.

S. Leglaive, R. Badeau, and &. G. Richard, Multichannel Audio Source Separation With Probabilistic Reverberation Priors, IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol.24, issue.12, 2016.
DOI : 10.1109/TASLP.2016.2614140
URL : https://hal.archives-ouvertes.fr/hal-01370051

B. Loesch and &. B. Yang, Online blind source separation based on time-frequency sparseness, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing, 2009.
DOI : 10.1109/ICASSP.2009.4959534

M. Mandel, R. J. Weiss, and D. P. , Elliset al. Model-based expectation-maximization source separation and localization

]. S. Markovich-golan-10, S. Markovich-golan, &. I. Gannot, and . Cohen, Subspace tracking of multiple sources and its application to speakers extraction, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing, 2010.
DOI : 10.1109/ICASSP.2010.5496044

]. T. May, S. Van-de-par, and &. A. Kohlrausch, A Probabilistic Model for Robust Localization Based on a Binaural Auditory Front-End, IEEE Transactions on Audio, Speech, and Language Processing, vol.19, issue.1, pp.1-13, 2011.
DOI : 10.1109/TASL.2010.2042128

R. Mukai, H. Sawada, S. Araki, and &. S. Makino, Robust real-time blind source separation for moving speakers in a room, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)., 2003.
DOI : 10.1109/ICASSP.2003.1200008

]. K. Nakadai, H. Nakajima, Y. Hasegawa, and &. H. Tsujino, Sound source separation of moving speakers for robot audition, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing, 2009.
DOI : 10.1109/ICASSP.2009.4960426

F. Neeser and &. J. Massey, Proper complex random processes with applications to information theory, IEEE Transactions on Information Theory, vol.39, issue.4, pp.1293-1302, 1993.
DOI : 10.1109/18.243446
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.408.1107

A. Ozerov, C. Févotte, R. Blouet, and &. Durrieu, Multichannel nonnegative tensor factorization with structured constraints for user-guided audio source separation, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2011.
DOI : 10.1109/ICASSP.2011.5946389
URL : https://hal.archives-ouvertes.fr/inria-00564851

A. Ozerov, E. Vincent, and &. F. Bimbot, A General Flexible Framework for the Handling of Prior Information in Audio Source Separation, IEEE Transactions on Audio, Speech, and Language Processing, vol.20, issue.4, pp.1118-1133, 2012.
DOI : 10.1109/TASL.2011.2172425
URL : https://hal.archives-ouvertes.fr/hal-00626962

L. Parra and &. C. Spence, Convolutive blind separation of non-stationary sources, IEEE Transactions on Speech and Audio Processing, vol.8, issue.3, pp.320-327, 2000.
DOI : 10.1109/89.841214

K. B. Petersen and &. S. Pedersen, The matrix cookbook. Version, 2012.

R. E. Prieto and &. P. Jinachitra, Blind source separation for timevariant mixing systems using piecewise linear approximations
DOI : 10.1109/icassp.2005.1416300

H. Sawada, R. Mukai, S. Araki, and &. S. Makino, A Robust and Precise Method for Solving the Permutation Problem of Frequency-Domain Blind Source Separation, IEEE Transactions on Speech and Audio Processing, vol.12, issue.5, pp.530-538, 2004.
DOI : 10.1109/TSA.2004.832994

H. Sawada, S. Araki, R. Mukai, and &. S. Makino, Grouping Separated Frequency Components by Estimating Propagation Model Parameters in Frequency-Domain Blind Source Separation, IEEE Transactions on Audio, Speech and Language Processing, vol.15, issue.5, pp.1592-1604, 2007.
DOI : 10.1109/TASL.2007.899218
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.453.4393

L. Simon and &. E. Vincent, A General Framework for Online Audio Source Separation, Int. Conf. Latent Variable Analysis and Signal Separation, 2012.
DOI : 10.1007/978-3-662-04619-7
URL : https://hal.archives-ouvertes.fr/hal-00655398

P. Smaragdis and &. J. Brown, Non-negative matrix factorization for polyphonic music transcription, 2003 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (IEEE Cat. No.03TH8684), 2003.
DOI : 10.1109/ASPAA.2003.1285860
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.475.7518

&. A. Smidl and . Quinn, The Variational Bayes Method in Signal Process, 2006.

N. Sturmel, A. Liutkus, J. Pinel, L. Girin, S. Marchand et al., Linear mixing models for active listening of music productions in realistic studio conditions, Convention of the Audio Engineering Society (AES) 13] V. Y. F. Tan & C. Févotte. Automatic Relevance Determination in Nonnegative Matrix Factorization with the beta-Divergence
URL : https://hal.archives-ouvertes.fr/hal-00790783

J. Traa and &. P. Smaragdis, Multichannel Source Separation and Tracking With RANSAC and Directional Statistics, IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol.22, issue.12, pp.2233-2243, 2014.
DOI : 10.1109/TASLP.2014.2365701

]. S. Tranter and &. D. Reynolds, An overview of automatic speaker diarization systems, IEEE Transactions on Audio, Speech and Language Processing, vol.14, issue.5, pp.1557-1565, 2006.
DOI : 10.1109/TASL.2006.878256
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.471.935

D. Vijayasenan, F. Valente, and &. H. Bourlard, Multistream speaker diarization of meetings recordings beyond MFCC and TDOA features, Speech Communication, vol.54, issue.1
DOI : 10.1016/j.specom.2011.07.001