A. L. Wang, An industrial-strength audio search algorithm, Proc. Int. Sym. on Music Information Retrieval (ISMIR), pp.1-4, 2003.

J. Haitsma and T. Kalker, A highly robust audio fingerprinting system, Proc. Int. Sym. on Music Information Retrieval (ISMIR), 2002.

M. Fink, M. Covell, and S. Baluja, Social-and interactive-television. applications based on real-time ambient-audio identification, Proc. European Interactive TV Conference (Euro-ITV), 2006.

C. Howson, E. Gautier, P. Gilberton, A. Laurent, and Y. Legallais, Second screen TV synchronization, 2011 IEEE International Conference on Consumer Electronics -Berlin (ICCE-Berlin), 2011.
DOI : 10.1109/ICCE-Berlin.2011.6031815

P. Cano, E. Batlle, T. Kalker, and J. Haitsma, A review of algorithms for audio fingerprinting, 2002 IEEE Workshop on Multimedia Signal Processing., pp.169-173, 2002.
DOI : 10.1109/MMSP.2002.1203274

H. J. Kim, Y. H. Choi, J. W. Seok, and J. W. Hong, Audio Watermarking Techniques, Intelligent Watermarking Techniques, pp.185-218, 2004.
DOI : 10.1142/9789812562524_0008

R. Macrae, X. Anguera, and N. Oliver, MuViSync: Realtime music video alignment, 2010 IEEE International Conference on Multimedia and Expo, 2010.
DOI : 10.1109/ICME.2010.5583863

N. Q. Duong, C. Howson, and Y. Legallais, Fast second screen TV synchronization combining audio fingerprint technique and generalized cross correlation, 2012 IEEE Second International Conference on Consumer Electronics, Berlin (ICCE-Berlin), pp.2012-241
DOI : 10.1109/ICCE-Berlin.2012.6336458

N. Q. Duong and F. Thudor, Movie synchronization by audio landmark matching, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing, pp.2013-3632
DOI : 10.1109/ICASSP.2013.6638335

URL : https://hal.archives-ouvertes.fr/hal-01289064

C. V. Cotton and D. P. Ellis, Audio fingerprinting to identify multiple videos of an event, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing, pp.2386-2389, 2010.
DOI : 10.1109/ICASSP.2010.5496185

Z. Rafii, B. Coover, and J. Han, An audio fingerprinting system for live version identification using image processing techniques, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2014.
DOI : 10.1109/ICASSP.2014.6853675

P. Cano, E. Batlle, T. Kalker, and J. Haitsma, A Review of Audio Fingerprinting, Journal of VLSI signal processing systems for signal, image and video technology, vol.33, issue.3, pp.271-284, 2005.
DOI : 10.1007/s11265-005-4151-3

G. Richard, S. Sundaram, and S. Narayanan, An Overview on Perceptually Motivated Audio Indexing and Classification, Proceedings of the IEEE, vol.101, issue.9, pp.1939-1954, 2013.
DOI : 10.1109/JPROC.2013.2251591

A. Ramalingam, S. Krishnan-]-e, J. Batlle, E. Masip, P. Guaus et al., Gaussian mixture modeling of shorttime fourier transform features for audio fingerprinting Scalability issues in hmm-based audio fingerprinting, Proc. IEEE Int. Conference on Multimedia and Expo, pp.457-463, 2004.

V. Chandrasekhar, M. Sharifi, and D. A. Ross, Survey and evaluation of audio fingerprinting schemes for mobile query-by-example applications, 12th International Society for Music Information Retrieval Conference (ISMIR), pp.801-806, 2011.

Y. Ke, D. Hoiem, and R. Sukthankar, Computer vision for music identification, Proc. Int. Conf. on Computer Vision and Pattern Recognition (CVPR), pp.597-604, 2005.

S. Baluja and M. Covell, Audio Fingerprinting: Combining Computer Vision & Data Stream Processing, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '07, 2007.
DOI : 10.1109/ICASSP.2007.366210

M. Ramona and G. Peeters, Automatic alignment of audio occurrences: application to the verification and synchronization of audio fingerprinting annotation, Proc. DAFX, pp.429-436, 2011.

E. Dupraz and G. Richard, Robust frequency-based Audio Fingerprinting, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing, pp.2091-2094, 2010.
DOI : 10.1109/ICASSP.2010.5495944

M. Moussallam and L. Daudet, A general framework for dictionary based audio fingerprinting, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp.3077-3081, 2014.
DOI : 10.1109/ICASSP.2014.6854166

J. S. Seo, M. Jin, S. Lee, D. Jang, S. Lee et al., Audio Fingerprinting Based on Normalized Spectral Subband Centroids, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005., pp.213-216, 2005.
DOI : 10.1109/ICASSP.2005.1415684

K. Seyerlehner, M. Schedl, P. Knees, and R. Sonnleitner, A refined block-level feature set for classification, similarity and tag prediction, Proc. Music Information Retrieval Evaluation eXchange (MIREX), 2011.

X. Anguera, A. Garzon, and T. Adamek, MASK: Robust Local Features for Audio Fingerprinting, 2012 IEEE International Conference on Multimedia and Expo, pp.455-460, 2012.
DOI : 10.1109/ICME.2012.137

P. Cano, E. Batlle, H. Mayer, and H. Neuschmied, Robust sound modeling for song detection in broadcast audio, Proc. 112th Audio Engineering Society Convention (AES), 2002.

J. Hao, T. Lee, and T. J. Sejnowski, Speech enhancement using gaussian scale mixture models, IEEE Trans. on Audio Speech and Language Processing, issue.18, pp.1127-1136, 2010.

C. J. Burges, J. C. Platt, and S. Jana, Distortion discriminant analysis for audio fingerprinting, IEEE Transactions on Speech and Audio Processing, vol.11, issue.3, pp.165-174, 2003.
DOI : 10.1109/TSA.2003.811538

J. Deng, W. Wan, X. Yu, and W. Yang, Audio fingerprinting based on spectral energy structure and nmf, Proc. Int. Conf. on Communication Technology (ICCT), pp.1103-1106, 2011.

B. Logan, Mel frequency cepstral coefficients for music modeling, Proc. Int. Sym. on Music Information Retrieval (ISMIR), 2002.

A. Bagri, F. Thudor, A. Ozerov, and P. Hellier, A scalable framework for joint clustering and synchronizing multi-camera videos, Proc. European Signal Processing Conference (EUSIPCO), pp.1-5, 2013.
URL : https://hal.archives-ouvertes.fr/hal-00870381

C. Yang, Macs: music audio characteristic sequence indexing for similarity retrieval, Proc. IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), pp.123-126, 2001.

J. Ogle and D. Ellis, Fingerprinting to Identify Repeated Sound Events in Long-Duration Personal Audio Recordings, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '07, pp.233-236, 2011.
DOI : 10.1109/ICASSP.2007.366659

N. J. Bryan, P. Smaragdis, and G. J. Mysore, Clustering and synchronizing multi-camera video via landmark cross-correlation, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp.2012-2389
DOI : 10.1109/ICASSP.2012.6288396

M. Ramona and G. Peeters, Audio identification based on spectral modeling of bark-bands energy and synchronization through onset detection, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp.477-480, 2011.
DOI : 10.1109/ICASSP.2011.5946444

URL : https://hal.archives-ouvertes.fr/hal-01161269

E. Allamanche, J. Herre, and O. Hellmuth, Content-based identification of audio material using mpeg-7 low level description, Proc. Int. Sym. on Music Information Retrieval (ISMIR), 2002.

J. Herre, E. Allamanche, and O. Hellmuth, Robust matching of audio signals using spectral flatness features, Proceedings of the 2001 IEEE Workshop on the Applications of Signal Processing to Audio and Acoustics (Cat. No.01TH8575), pp.127-130, 2001.
DOI : 10.1109/ASPAA.2001.969559

D. Reynolds and R. Rose, Robust text-independent speaker identification using Gaussian mixture speaker models, IEEE Transactions on Speech and Audio Processing, vol.3, issue.1, pp.72-83, 1995.
DOI : 10.1109/89.365379

L. Benaroya, F. Bimbot, and R. Gribonval, Audio source separation with a single sensor, IEEE Transactions on Audio, Speech and Language Processing, vol.14, issue.1, pp.191-199, 2006.
DOI : 10.1109/TSA.2005.854110

URL : https://hal.archives-ouvertes.fr/inria-00544949

L. R. Rabiner, A tutorial on hmm and selected applications in speech recognition, Proceeding of the IEEE, pp.257-286, 1989.

D. D. Lee and H. S. Seung, Learning the parts of objects with nonnegative matrix factorization On-the-fly audio source separation, IEEE Int. Workshop on Machine Learning for Signal Processing (MLSP), pp.788-791, 1999.

N. Q. Duong, A. Ozerov, L. Chevallier, and J. Sirot, An interactive audio source separation framework based on nonnegative matrix factorization, IEEE Int. Conf. on Acoustics, Speech, and Signal Processing, pp.2014-1586
URL : https://hal.archives-ouvertes.fr/hal-00960717

C. Févotte, N. Bertin, and J. Durrieu, Nonnegative Matrix Factorization with the Itakura-Saito Divergence: With Application to Music Analysis, Neural Computation, vol.14, issue.3, pp.793-830, 2009.
DOI : 10.1016/j.sigpro.2007.01.024

N. Chen, H. Xiao, and W. Wan, Audio hash function based on nonnegative matrix factorisation of mel-frequency cepstral coefficients, IET Information Security, issue.1, pp.19-25, 2011.