E. Vincent, S. Araki, F. Theis, G. Nolte, P. Bofill et al., The signal separation evaluation campaign (2007???2010): Achievements and remaining challenges, Signal Processing, vol.92, issue.8, 1928.
DOI : 10.1016/j.sigpro.2011.10.007

URL : https://hal.archives-ouvertes.fr/inria-00579398

J. Ganseman, G. J. Mysore, J. S. Abel, and P. Scheunders, Source separation by score synthesis, Proc. Int. Computer Music Conference (ICMC), pp.462-465, 2010.

R. Hennequin, B. David, and R. Badeau, Score informed audio source separation using a parametric model of non-negative spectrogram, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp.45-48, 2011.
DOI : 10.1109/ICASSP.2011.5946324

URL : https://hal.archives-ouvertes.fr/hal-00945294

U. Simsekli and A. T. , Score guided musical source separation using generalized coupled tensor factorization, Proc. 20th European Signal Processing Conference (EUSIPCO), pp.2639-2643, 2012.

J. Fritsch and M. D. Plumbley, Score informed audio source separation using constrained nonnegative matrix factorization and score synthesis, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing, pp.2013-888
DOI : 10.1109/ICASSP.2013.6637776

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=

P. Smaragdis and G. J. Mysore, Separation by " humming " : Userguided sound extraction from monophonic mixtures, Proceedings IEEE Workshop Applications of Signal Processing to Audio and Acoustics (WASPAA), pp.69-72, 2009.
DOI : 10.1109/aspaa.2009.5346542

D. Fitzgerald, User assisted source separation using non-negative matrix factorisation, 22nd IET Irish Signals and Systems Conference, 2011.

J. L. Durrieu and J. P. Thiran, Musical Audio Source Separation Based on User-Selected F0 Track, Proc. Int. Conf. on Latent Variable Analysis and Signal Separation (LVA/ICA), pp.438-445, 2012.
DOI : 10.1109/TSA.2005.860342

URL : http://infoscience.epfl.ch/record/174056

A. Ozerov, C. Févotte, R. Blouet, and J. Durrieu, Multichannel nonnegative tensor factorization with structured constraints for userguided audio source separation, Proc. IEEE Int. Conf. on Acoustics, speech, and signal processing (ICASSP), pp.257-260, 2011.
DOI : 10.1109/icassp.2011.5946389

URL : https://hal.archives-ouvertes.fr/inria-00564851

A. Lefèvre, F. Bach, and C. Févotte, Semi-supervised NMF with timefrequency annotations for single-channel source separation, Proc. Int. Symposium on Music Information Retrieval (ISMIR), pp.115-120, 2012.

B. Fuentes, R. Badeau, and G. Richard, Blind harmonic adaptive decomposition applied to supervised source separation, Proc. 20th European Signal Processing Conference (EUSIPCO), pp.2654-2658, 2012.
URL : https://hal.archives-ouvertes.fr/hal-00945288

S. T. Roweis, One microphone source separation, Advances in Neural Information Processing Systems 13, pp.793-799, 2000.

W. Wang, D. Cosker, Y. Hicks, S. Sanei, and J. A. Chambers, Video Assisted Speech Source Separation, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005., pp.425-428, 2005.
DOI : 10.1109/ICASSP.2005.1416331

G. J. Mysore and P. Smaragdis, A Non-negative Approach to Language Informed Speech Separation, Proc. Int. Conf. on Latent Variable Analysis and Signal Separation, pp.356-363, 2012.
DOI : 10.1109/TSA.2005.858005

M. Kim, J. Yoo, K. Kang, and S. Choi, Nonnegative Matrix Partial Co-Factorization for Spectral and Temporal Drum Source Separation, IEEE Journal of Selected Topics in Signal Processing, vol.5, issue.6, pp.1192-1204, 2011.
DOI : 10.1109/JSTSP.2011.2158803

T. Virtanen and A. Klapuri, Analysis of polyphonic audio using source-filter model and non-negative matrix factorization, Advances in Models for Acoustic Processing, Neural Information Processing Systems Workshop, 2006.

J. L. Durrieu, G. Richard, B. David, and C. Févotte, Source/Filter Model for Unsupervised Main Melody Extraction From Polyphonic Audio Signals, IEEE Transactions on Audio, Speech, and Language Processing, vol.18, issue.3, pp.564-575, 2010.
DOI : 10.1109/TASL.2010.2041114

A. Pedone, J. J. Burred, S. Maller, and P. Leveau, Phoneme-level text to audio synchronization on speech signals with background music, Proc. INTERSPEECH, pp.433-436, 2011.

C. Févotte, N. Bertin, and J. Durrieu, Nonnegative Matrix Factorization with the Itakura-Saito Divergence: With Application to Music Analysis, Neural Computation, vol.14, issue.3, pp.793-830, 2009.
DOI : 10.1016/j.sigpro.2007.01.024

A. Ozerov, E. Vincent, and F. Bimbot, A General Flexible Framework for the Handling of Prior Information in Audio Source Separation, IEEE Transactions on Audio, Speech, and Language Processing, vol.20, issue.4, pp.1118-1133, 2012.
DOI : 10.1109/TASL.2011.2172425

URL : https://hal.archives-ouvertes.fr/inria-00536917

J. Garofolo, L. Lamel, W. Fisher, J. Fiscus, D. Pallett et al., DARPA TIMIT: Acoustic-phonetic continuous speech corpus, Tech. Rep., NIST, 1993.

C. P. Chen, J. Bilmes, and K. Kirchhoff, Low-resource noise-robust feature post-processing on Aurora 2.0, Proc. Int. Conf. on Spoken Language Processing (ICSLP), pp.2445-2448, 2002.

E. Vincent, R. Gribonval, and C. Fevotte, Performance measurement in blind audio source separation, IEEE Transactions on Audio, Speech and Language Processing, vol.14, issue.4, pp.1462-1469, 2006.
DOI : 10.1109/TSA.2005.858005

URL : https://hal.archives-ouvertes.fr/inria-00544230

V. Emiya, E. Vincent, N. Harlander, and V. Hohmann, Subjective and Objective Quality Assessment of Audio Source Separation, IEEE Transactions on Audio, Speech, and Language Processing, vol.19, issue.7, pp.2046-2057
DOI : 10.1109/TASL.2011.2109381

URL : https://hal.archives-ouvertes.fr/inria-00567152