Speaker Diarization: A Review of Recent Research, IEEE Transactions on Audio, Speech, and Language Processing, vol.20, issue.2, pp.356-370, 2012. ,
DOI : 10.1109/TASL.2011.2125954
Robust Online Multi-object Tracking Based on Tracklet Confidence and Online Discriminative Appearance Learning, 2014 IEEE Conference on Computer Vision and Pattern Recognition, pp.1218-1225, 2014. ,
DOI : 10.1109/CVPR.2014.159
Co-Localization of Audio Sources in Images Using Binaural Features and Locally-Linear Regression, IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol.23, issue.4, pp.718-731, 2015. ,
DOI : 10.1109/TASLP.2015.2405475
URL : https://hal.archives-ouvertes.fr/hal-01112834
Audiovisual Probabilistic Tracking of Multiple Speakers in Meetings, IEEE Transactions on Audio, Speech and Language Processing, vol.15, issue.2, pp.601-616, 2007. ,
DOI : 10.1109/TASL.2006.881678
Cross-Modal Localization via Sparsity, IEEE Transactions on Signal Processing, vol.55, issue.4, pp.1390-1404, 2007. ,
DOI : 10.1109/TSP.2006.888095
A Multimodal Approach to Blind Source Separation of Moving Sources, IEEE Journal of Selected Topics in Signal Processing, vol.4, issue.5, pp.895-910, 2010. ,
DOI : 10.1109/JSTSP.2010.2057198
Multimodal Speaker Diarization, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.34, issue.1, pp.79-93, 2012. ,
DOI : 10.1109/TPAMI.2011.47
Recent advances in the automatic recognition of audiovisual speech, Proceedings of the IEEE, vol.91, issue.9, pp.1306-1326, 2003. ,
DOI : 10.1109/JPROC.2003.817150
A statistical model-based voice activity detection, IEEE Signal Processing Letters, vol.6, issue.1, pp.1-3, 1999. ,
DOI : 10.1109/97.736233