L. , L. Magoarou, A. Ozerov, and N. Q. Duong, Textinformed audio source separation using nonnegative matrix partial co-factorization, Machine Learning for Signal Processing (MLSP), 2013 IEEE International Workshop on, pp.1-6, 2013.
URL : https://hal.archives-ouvertes.fr/hal-00870066

E. Vincent, S. Araki, F. Theis, G. Nolte, P. Bofill et al., The signal separation evaluation campaign): Achievements and remaining challenges, Signal Processing, issue.8, pp.921928-1936, 2007.
URL : https://hal.archives-ouvertes.fr/inria-00579398

J. Ganseman, G. J. Mysore, J. S. Abel, and P. Scheunders, Source separation by score synthesis, Proc. Int. Computer Music Conference (ICMC), pp.462-465, 2010.

R. Hennequin, B. David, and R. Badeau, Score informed audio source separation using a parametric model of non-negative spectrogram, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp.45-48, 2011.
DOI : 10.1109/ICASSP.2011.5946324

URL : https://hal.archives-ouvertes.fr/hal-00945294

U. Simsekli and A. T. , Score guided musical source separation using generalized coupled tensor factorization, Proc. 20th European Signal Processing Conference (EUSIPCO), pp.2639-2643, 2012.

J. Fritsch and M. D. Plumbley, Score informed audio source separation using constrained nonnegative matrix factorization and score synthesis, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing, pp.888-891, 2013.
DOI : 10.1109/ICASSP.2013.6637776

P. Smaragdis and G. J. Mysore, Separation by “humming”: User-guided sound extraction from monophonic mixtures, 2009 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, pp.69-72, 2009.
DOI : 10.1109/ASPAA.2009.5346542

D. Fitzgerald, User assisted source separation using nonnegative matrix factorisation, 22nd IET Irish Signals and Systems Conference, 2011.

J. L. Durrieu and J. P. Thiran, Musical Audio Source Separation Based on User-Selected F0 Track, Proc. Int. Conf. on Latent Variable Analysis and Signal Separation (LVA/ICA), pp.438-445, 2012.
DOI : 10.1109/TSA.2005.860342

A. Ozerov, C. Févotte, R. Blouet, and J. Durrieu, Multichannel nonnegative tensor factorization with structured constraints for user-guided audio source separation, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
DOI : 10.1109/ICASSP.2011.5946389

URL : https://hal.archives-ouvertes.fr/inria-00564851

A. Lefèvre, F. Bach, and C. Févotte, Semi-supervised NMF with time-frequency annotations for single-channel source separation, Proc. Int. Symposium on Music Information Retrieval (ISMIR), pp.115-120, 2012.

N. J. Bryan and G. J. Mysore, Interactive user-feedback for sound source separation, International Conference on Intelligent User Interfaces (IUI), 2013.

Q. K. Duong, N. , A. Ozerov, L. Chevallier, and J. Sirot, An interactive audio source separation framework based on non-negative matrix factorization, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
DOI : 10.1109/ICASSP.2014.6853861

URL : https://hal.archives-ouvertes.fr/hal-00960717

S. T. Roweis, One microphone source separation, Advances in Neural Information Processing Systems 13, pp.793-799, 2000.

W. Wang, D. Cosker, Y. Hicks, S. Sanei, and J. A. Chambers, Video Assisted Speech Source Separation, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005., pp.425-428, 2005.
DOI : 10.1109/ICASSP.2005.1416331

G. J. Mysore and P. Smaragdis, A Non-negative Approach to Language Informed Speech Separation, Proc. Int. Conf. on Latent Variable Analysis and Signal Separation (LVA / ICA), pp.356-363, 2012.
DOI : 10.1109/TSA.2005.858005

M. Kim, J. Yoo, K. Kang, and S. Choi, Nonnegative Matrix Partial Co-Factorization for Spectral and Temporal Drum Source Separation, IEEE Journal of Selected Topics in Signal Processing, vol.5, issue.6, pp.1192-1204, 2011.
DOI : 10.1109/JSTSP.2011.2158803

T. Virtanen and A. Klapuri, Analysis of polyphonic audio using source-filter model and non-negative matrix factorization, Advances in Models for Acoustic Processing, Neural Information Processing Systems Workshop, 2006.

J. L. Durrieu, G. Richard, B. David, and C. Févotte, Source/Filter Model for Unsupervised Main Melody Extraction From Polyphonic Audio Signals, IEEE Transactions on Audio, Speech, and Language Processing, vol.18, issue.3, pp.564-575, 2010.
DOI : 10.1109/TASL.2010.2041114

A. Ozerov, E. Vincent, and F. Bimbot, A General Flexible Framework for the Handling of Prior Information in Audio Source Separation, IEEE Transactions on Audio, Speech, and Language Processing, vol.20, issue.4, pp.1118-1133, 2012.
DOI : 10.1109/TASL.2011.2172425

URL : https://hal.archives-ouvertes.fr/inria-00536917

N. Q. Duong, E. Vincent, and R. Gribonval, Under-Determined Reverberant Audio Source Separation Using a Full-Rank Spatial Covariance Model, IEEE Transactions on Audio, Speech, and Language Processing, vol.18, issue.7, pp.1830-1840, 2010.
DOI : 10.1109/TASL.2010.2050716

URL : https://hal.archives-ouvertes.fr/inria-00435807

N. Ono, Z. Koldovsky, S. Miyabe, and N. Ito, The 2013 Signal Separation Evaluation Campaign, 2013 IEEE International Workshop on Machine Learning for Signal Processing (MLSP), pp.1-6, 2013.
DOI : 10.1109/MLSP.2013.6661988

C. Févotte, N. Bertin, and J. Durrieu, Nonnegative Matrix Factorization with the Itakura-Saito Divergence: With Application to Music Analysis, Neural Computation, vol.14, issue.3, pp.793-830, 2009.
DOI : 10.1016/j.sigpro.2007.01.024

A. Pedone, J. J. Burred, S. Maller, and P. Leveau, Phoneme-level text to audio synchronization on speech signals with background music, Proc. INTER- SPEECH, pp.433-436, 2011.

D. Ellis, Dynamic time warp (DTW) in Matlab. Web resource, 2003.

J. Garofolo, L. Lamel, W. Fisher, J. Fiscus, D. Pallett et al., Acoustic-phonetic continuous speech corpus, NIST, 1993.

E. Vincent, R. Gribonval, and C. Fevotte, Performance measurement in blind audio source separation, IEEE Transactions on Audio, Speech and Language Processing, vol.14, issue.4, pp.1462-1469, 2006.
DOI : 10.1109/TSA.2005.858005

URL : https://hal.archives-ouvertes.fr/inria-00544230

V. Emiya, E. Vincent, N. Harlander, and V. Hohmann, Subjective and Objective Quality Assessment of Audio Source Separation, IEEE Transactions on Audio, Speech, and Language Processing, vol.19, issue.7, pp.2046-2057
DOI : 10.1109/TASL.2011.2109381

URL : https://hal.archives-ouvertes.fr/inria-00485729

N. Seichepine, S. Essid, C. Févotte, and O. Cappé, Soft nonnegative matrix co-factorizationwith application to multimodal speaker diarization, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing, pp.3537-3541, 2013.
DOI : 10.1109/ICASSP.2013.6638316