A. Miro, X. Bozonnet, S. Evans, N. Fredouille, C. Friedland et al., Speaker Diarization: A Review of Recent Research, IEEE Transactions on Audio, Speech, and Language Processing, vol.20, issue.2, pp.356-370, 2012.
DOI : 10.1109/TASL.2011.2125954

S. H. Bae and K. J. Yoon, Robust Online Multi-object Tracking Based on Tracklet Confidence and Online Discriminative Appearance Learning, 2014 IEEE Conference on Computer Vision and Pattern Recognition, pp.1218-1225, 2014.
DOI : 10.1109/CVPR.2014.159

A. Deleforge, R. Horaud, Y. Y. Schechner, and L. Girin, Co-Localization of Audio Sources in Images Using Binaural Features and Locally-Linear Regression, IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol.23, issue.4, pp.718-731, 2015.
DOI : 10.1109/TASLP.2015.2405475

URL : https://hal.archives-ouvertes.fr/hal-01112834

D. Gatica-perez, G. Lathoud, J. M. Odobez, and I. Mccowan, Audiovisual Probabilistic Tracking of Multiple Speakers in Meetings, IEEE Transactions on Audio, Speech and Language Processing, vol.15, issue.2, pp.601-616, 2007.
DOI : 10.1109/TASL.2006.881678

E. Kidron, Y. Y. Schechner, and M. Elad, Cross-Modal Localization via Sparsity, IEEE Transactions on Signal Processing, vol.55, issue.4, pp.1390-1404, 2007.
DOI : 10.1109/TSP.2006.888095

S. Naqvi, M. Yu, and J. Chambers, A Multimodal Approach to Blind Source Separation of Moving Sources, IEEE Journal of Selected Topics in Signal Processing, vol.4, issue.5, pp.895-910, 2010.
DOI : 10.1109/JSTSP.2010.2057198

A. Noulas, G. Englebienne, and B. J. Krose, Multimodal Speaker Diarization, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.34, issue.1, pp.79-93, 2012.
DOI : 10.1109/TPAMI.2011.47

G. Potamianos, C. Neti, G. Gravier, A. Garg, and A. W. Senior, Recent advances in the automatic recognition of audiovisual speech, Proceedings of the IEEE, vol.91, issue.9, pp.1306-1326, 2003.
DOI : 10.1109/JPROC.2003.817150

J. Sohn, N. S. Kim, and W. Sung, A statistical model-based voice activity detection, IEEE Signal Processing Letters, vol.6, issue.1, pp.1-3, 1999.
DOI : 10.1109/97.736233