D. D. Lee and H. S. Seung, Learning the parts of objects with nonnegative matrix factorization, Nature, vol.401, pp.788-791, 1999.

T. Virtanen, Monaural Sound Source Separation by Nonnegative Matrix Factorization With Temporal Continuity and Sparseness Criteria, IEEE Transactions on Audio, Speech and Language Processing, vol.15, issue.3, pp.1066-1074, 2007.
DOI : 10.1109/TASL.2006.885253

M. N. Schmidt and R. K. Olsson, Single-channel speech separation using sparse non-negative matrix factorization, Spoken Language Proceesing, ISCA International Conference on (INTERSPEECH), 2006.

L. , L. Magoarou, A. Ozerov, and N. Q. Duong, Text-informed audio source separation. example-based approach using non-negative matrix partial cofactorization, Journal of Signal Processing Systems, vol.79, issue.2, pp.117-131, 2015.
URL : https://hal.archives-ouvertes.fr/hal-00870066

C. Févotte, N. Bertin, and J. Durrieu, Nonnegative Matrix Factorization with the Itakura-Saito Divergence: With Application to Music Analysis, Neural Computation, vol.14, issue.3, pp.793-830, 2009.
DOI : 10.1016/j.sigpro.2007.01.024

D. Badawy, N. Q. Duong, and A. Ozerov, On-the-Fly Audio Source Separation???A Novel User-Friendly Framework, IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol.25, issue.2, pp.261-272, 2017.
DOI : 10.1109/TASLP.2016.2632528

URL : https://hal.archives-ouvertes.fr/hal-01400990

E. Vincent, N. Bertin, and R. Badeau, Adaptive Harmonic Spectral Decomposition for Multiple Pitch Estimation, IEEE Transactions on Audio, Speech, and Language Processing, vol.18, issue.3, pp.528-537, 2010.
DOI : 10.1109/TASL.2009.2034186

URL : https://hal.archives-ouvertes.fr/inria-00544094

A. Ozerov, E. Vincent, and F. Bimbot, A General Flexible Framework for the Handling of Prior Information in Audio Source Separation, IEEE Transactions on Audio, Speech, and Language Processing, vol.20, issue.4, pp.1118-1133, 2012.
DOI : 10.1109/TASL.2011.2172425

URL : https://hal.archives-ouvertes.fr/inria-00536917

N. Mohammadiha, P. Smaragdis, and A. Leijon, Supervised and Unsupervised Speech Enhancement Using Nonnegative Matrix Factorization, IEEE Transactions on Audio, Speech, and Language Processing, vol.21, issue.10, pp.2140-2151, 2013.
DOI : 10.1109/TASL.2013.2270369

URL : http://kth.diva-portal.org/smash/get/diva2:634165/FULLTEXT02

D. Fitzgerald, M. Cranitch, and E. Coyle, Non-negative tensor factorisation for sound source separation, IEE Irish Signals and Systems Conference 2005, 2005.
DOI : 10.1049/cp:20050279

A. Ozerov and C. Févotte, Multichannel Nonnegative Matrix Factorization in Convolutive Mixtures for Audio Source Separation, IEEE Transactions on Audio, Speech, and Language Processing, vol.18, issue.3, pp.550-563, 2010.
DOI : 10.1109/TASL.2009.2031510

H. Sawada, R. Mukai, S. Araki, and S. Makino, A Robust and Precise Method for Solving the Permutation Problem of Frequency-Domain Blind Source Separation, IEEE Transactions on Speech and Audio Processing, vol.12, issue.5, pp.530-538, 2004.
DOI : 10.1109/TSA.2004.832994

M. I. Mandel, D. P. Ellis, and T. Jebara, An EM algorithm for localizing multiple sound sources in reverberant environments, NIPS, 2006.

A. Ozerov, C. Févotte, R. Blouet, and J. Durrieu, Multichannel nonnegative tensor factorization with structured constraints for user-guided audio source separation, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp.257-260, 2011.
DOI : 10.1109/ICASSP.2011.5946389

URL : https://hal.archives-ouvertes.fr/inria-00564851

H. Sawada, H. Kameoka, S. Araki, and N. Ueda, Multichannel Extensions of Non-Negative Matrix Factorization With Complex-Valued Data, IEEE Transactions on Audio, Speech, and Language Processing, vol.21, issue.5, pp.971-982, 2013.
DOI : 10.1109/TASL.2013.2239990

J. Nikunen and T. Virtanen, Direction of Arrival Based Spatial Covariance Model for Blind Sound Source Separation, IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol.22, issue.3, pp.727-739, 2014.
DOI : 10.1109/TASLP.2014.2303576

N. Q. Duong, E. Vincent, and R. Gribonval, Under-Determined Reverberant Audio Source Separation Using a Full-Rank Spatial Covariance Model, IEEE Transactions on Audio, Speech, and Language Processing, vol.18, issue.7, pp.1830-1840, 2010.
DOI : 10.1109/TASL.2010.2050716

URL : https://hal.archives-ouvertes.fr/inria-00435807

C. Févotte and J. Cardoso, Maximum likelihood approach for blind audio source separation using time-frequency Gaussian source models, IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2005., pp.78-81, 2005.
DOI : 10.1109/ASPAA.2005.1540173

E. Vincent, S. Arberet, and R. Gribonval, Underdetermined Instantaneous Audio Source Separation via Local Gaussian Modeling, International Conference on Independent Component Analysis and Signal Separation, pp.775-782, 2009.
DOI : 10.1109/TSP.2004.828896

URL : https://hal.archives-ouvertes.fr/hal-00482223

H. Kameoka, T. Yoshioka, M. Hamamura, J. L. Roux, and K. Kashino, Statistical Model of Speech Signals Based on Composite Autoregressive System with Application to Blind Source Separation, International Conference on Latent Variable Analysis and Signal Separation, pp.245-253, 2010.
DOI : 10.1007/978-3-642-15995-4_31

T. Higuchi, H. Takeda, T. Nakamura, and H. Kameoka, A unified approach for underdetermined blind signal separation and source activity detection by multichannel factorial hidden markov models, INTERSPEECH, pp.850-854, 2014.

J. Breebaart, S. Van-de-par, A. Kohlrausch, and E. Schuijers, Parametric Coding of Stereo Audio, EURASIP Journal on Advances in Signal Processing, vol.2005, issue.9, pp.1305-1322, 2005.
DOI : 10.1155/ASP.2005.1305

URL : https://doi.org/10.1155/asp.2005.1305

M. I. Mandel, R. J. Weiss, and D. P. Ellis, Model-Based Expectation-Maximization Source Separation and Localization, IEEE Transactions on Audio, Speech, and Language Processing, vol.18, issue.2, pp.382-394, 2010.
DOI : 10.1109/TASL.2009.2029711

URL : http://www.ee.columbia.edu/%7Eronw/pubs/taslp09-messl.pdf

E. Vincent and X. Rodet, Underdetermined Source Separation with Structured Source Priors, International Conference on Independent Component Analysis and Signal Separation, pp.327-334, 2004.
DOI : 10.1007/978-3-540-30110-3_42

URL : https://hal.archives-ouvertes.fr/inria-00544694

E. Vincent, Musical source separation using time-frequency source priors, IEEE Transactions on Audio, Speech and Language Processing, vol.14, issue.1, pp.91-98, 2006.
DOI : 10.1109/TSA.2005.860342

URL : https://hal.archives-ouvertes.fr/inria-00544269

S. Arberet, A. Ozerov, N. Q. Duong, E. Vincent, R. Gribonval et al., Nonnegative matrix factorization and spatial covariance model for under-determined reverberant audio source separation, 10th International Conference on Information Science, Signal Processing and their Applications (ISSPA 2010), pp.1-4, 2010.
DOI : 10.1109/ISSPA.2010.5605570

URL : https://hal.archives-ouvertes.fr/inria-00541436

T. Virtanen and A. Klapuri, Analysis of polyphonic audio using sourcefilter model and non-negative matrix factorization, Advances in models for acoustic processing, neural information processing systems workshop. Citeseer, 2006.

N. Souvirà-a-labastie, A. Olivero, E. Vincent, and F. Bimbot, Multi-Channel Audio Source Separation Using Multiple Deformed References, IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol.23, issue.11, pp.1775-1787, 2015.
DOI : 10.1109/TASLP.2015.2450494

V. Y. Tan and C. Févotte, Automatic Relevance Determination in Nonnegative Matrix Factorization with the /spl beta/-Divergence, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.35, issue.7, pp.1592-1605, 2013.
DOI : 10.1109/TPAMI.2012.240

R. Bro, Parafac. tutorial and applications Chemometrics and intelligent laboratory systems, pp.149-171, 1997.

L. Parra and C. Spence, Convolutive blind separation of non-stationary sources, IEEE transactions on Speech and Audio Processing, pp.320-327, 2000.
DOI : 10.1109/89.841214

S. Gannot, E. Vincent, S. Markovich-golan, and A. Ozerov, A Consolidated Perspective on Multimicrophone Speech Enhancement and Source Separation, IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol.25, issue.4, pp.692-730, 2017.
DOI : 10.1109/TASLP.2016.2647702

URL : https://hal.archives-ouvertes.fr/hal-01414179

N. Q. Duong, E. Vincent, and R. Gribonval, Spatial location priors for Gaussian model based reverberant audio source separation, EURASIP Journal on Advances in Signal Processing, vol.92, issue.4, p.149, 2013.
DOI : 10.1007/978-3-642-15995-4_8

URL : https://hal.archives-ouvertes.fr/hal-00727781

R. Badeau and M. D. Plumbley, Multichannel High-Resolution NMF for Modeling Convolutive Mixtures of Non-Stationary Signals in the Time-Frequency Domain, IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol.22, issue.11, pp.1670-1680, 2014.
DOI : 10.1109/TASLP.2014.2341920

D. Kounades-bastian, L. Girin, X. Alameda-pineda, S. Gannot, and R. Horaud, An inverse-gamma source variance prior with factorized parameterization for audio source separation, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp.136-140, 2016.
DOI : 10.1109/ICASSP.2016.7471652

URL : https://hal.archives-ouvertes.fr/hal-01253169

N. Q. Duong, H. Tachibana, E. Vincent, N. Ono, R. Gribonval et al., Multichannel harmonic and percussive component separation by joint modeling of spatial and spectral continuity, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp.205-208, 2011.
DOI : 10.1109/ICASSP.2011.5946376

URL : https://hal.archives-ouvertes.fr/inria-00557145

T. Higuchi, N. Takamune, T. Nakamura, and H. Kameoka, Underdetermined blind separation and tracking of moving sources based on DOA-HMM, Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on, pp.3191-3195, 2014.

D. Kounades-bastian, L. Girin, X. Alameda-pineda, S. Gannot, and R. Horaud, A Variational EM Algorithm for the Separation of Time-Varying Convolutive Audio Mixtures, IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol.24, issue.8, pp.1408-1423, 2016.
DOI : 10.1109/TASLP.2016.2554286

URL : https://hal.archives-ouvertes.fr/hal-01301762

M. Togami, Online speech source separation based on maximum likelihood of local Gaussian modeling, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp.213-216, 2011.
DOI : 10.1109/ICASSP.2011.5946378

L. S. Simon and E. Vincent, A General Framework for Online Audio Source Separation, International conference on Latent Variable Analysis and Signal Separation, pp.397-404, 2012.
DOI : 10.1007/978-3-662-04619-7

URL : https://hal.archives-ouvertes.fr/hal-00655398

N. Q. Duong, E. Vincent, and R. Gribonval, Under-determined reverberant audio source separation using local observed covariance and auditorymotivated time-frequency representation, International Conference on Latent Variable Analysis and Signal Separation, pp.73-80, 2010.
DOI : 10.1007/978-3-642-15995-4_10

URL : https://hal.archives-ouvertes.fr/inria-00541868

K. Adilo?-glu and E. Vincent, Variational Bayesian Inference for Source Separation and Robust Feature Extraction, IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol.24, issue.10, pp.1746-1758, 2016.
DOI : 10.1109/TASLP.2016.2583794

A. P. Dempster, N. M. Laird, and D. B. Rubin, Maximum likelihood from incomplete data via the EM algorithm, Journal of the Royal Statistical Society: Series B (Statistical Methodology), vol.39, pp.1-38, 1977.

J. Thiemann and E. Vincent, A fast EM algorithm for Gaussian model-based source separation, Signal Processing Conference (EUSIPCO), 2013 Proceedings of the 21st European, pp.1-5, 2013.
URL : https://hal.archives-ouvertes.fr/hal-00840366

D. R. Hunter and K. Lange, A Tutorial on MM Algorithms, The American Statistician, vol.58, issue.1, pp.30-37, 2004.
DOI : 10.1198/0003130042836