C. Blandin, A. Ozerov, and E. Vincent, Multi-source TDOA estimation in reverberant audio using angular spectra and clustering, Signal Processing, vol.92, issue.8, pp.1950-1960, 2012.
DOI : 10.1016/j.sigpro.2011.09.032
URL : https://hal.archives-ouvertes.fr/inria-00576297

A. S. Bregman, Auditory Scene Analysis, 1990.

J. L. Durrieu, B. David, and G. Richard, A Musically Motivated Mid-Level Representation for Pitch Estimation and Musical Audio Source Separation, IEEE Journal of Selected Topics in Signal Processing, vol.5, issue.6, pp.1180-1191, 2011.
DOI : 10.1109/JSTSP.2011.2158801

D. Fitzgerald, Vocal Separation using Nearest Neighbours and Median Filtering, IET Irish Signals and Systems Conference (ISSC 2012), 2012.
DOI : 10.1049/ic.2012.0225

D. Fitzgerald and M. Gainza, Single channel vocal separation using median filtering and factorisation techniques, ISAST Transactions on Electronic and Signal Processing, vol.4, issue.1, pp.62-73, 2010.

J. Foote, Visualizing music and audio using self-similarity, Proceedings of the seventh ACM international conference on Multimedia (Part 1) , MULTIMEDIA '99, pp.77-80, 1999.
DOI : 10.1145/319463.319472
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.223.194

J. Foote and S. Uchihashi, The beat spectrum: a new approach to rhythm analysis, IEEE International Conference on Multimedia and Expo, 2001. ICME 2001., pp.881-884, 2001.
DOI : 10.1109/ICME.2001.1237863

C. L. Hsu and J. S. Jang, On the improvement of singing voice separation for monaural recordings using the MIR-1K dataset, IEEE Transactions on Audio, Speech, and Language Processing, vol.18, issue.2, pp.310-319, 2010.

A. Liutkus, Z. Rafii, R. Badeau, B. Pardo, and G. Richard, Adaptive filtering for music/voice separation exploiting the repeating musical structure, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2012.
DOI : 10.1109/ICASSP.2012.6287815
URL : https://hal.archives-ouvertes.fr/hal-00945300

J. H. Mcdermott, D. Wrobleski, and A. J. Oxenham, Recovering sound sources from embedded repetition, Proceedings of the National Academy of Sciences, vol.108, issue.3, pp.1188-1193, 2011.
DOI : 10.1073/pnas.1004765108

F. Nesta and M. Matassoni, Robust automatic speech recognition through on-line semi blind source extraction, CHIME 2011 Workshop on Machine Listening in Multisource Environments, pp.18-23, 2011.

M. Piccardi, Background subtraction techniques: a review Man and Cybernetics. The Hague, The Netherlands, IEEE International Conference on Systems, 2004.

Z. Rafii and B. Pardo, A simple music/voice separation system based on the extraction of the repeating musical structure, 36th International Conference on Acoustics, Speech and Signal Processing, 2011.

Z. Rafii and B. Pardo, Music/voice separation using the similarity matrix, 13th International Society for Music Information Retrieval, 2012.

Z. Rafii and B. Pardo, Online REPET-SIM for real-time speech enhancement, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing, 2013.
DOI : 10.1109/ICASSP.2013.6637768

Z. Rafii and B. Pardo, REpeating Pattern Extraction Technique (REPET): A Simple Method for Music/Voice Separation, IEEE Transactions on Audio, Speech, and Language Processing, vol.21, issue.1, pp.71-82, 2013.
DOI : 10.1109/TASL.2012.2213249

Z. Rafii, D. L. Sun, F. G. Germain, and G. J. Mysore, Combining modeling of singing voice and background music for automatic separation of musical mixtures, 14th International Society for Music Information Retrieval, 2013.

S. Rangachari and P. C. Loizou, A noise-estimation algorithm for highly non-stationary environments, Speech Communication, vol.48, issue.2, pp.220-231, 2006.
DOI : 10.1016/j.specom.2005.08.005

E. Rubin, Synsoplevede Figurer, Gyldendal, 1915.

Ö. Yilmaz and S. Rickard, Blind Separation of Speech Mixtures via Time-Frequency Masking, IEEE Transactions on Signal Processing, vol.52, issue.7, pp.1830-1847, 2004.
DOI : 10.1109/TSP.2004.828896