The Ravel data set, 2011. ,
URL : https://hal.archives-ouvertes.fr/inria-00614483
Using Bayes' Rule to Model Multisensory Enhancement in the Superior Colliculus, Neural Computation, vol.53, issue.3, pp.1165-1187, 2000. ,
DOI : 10.1016/S0079-6123(08)63337-3
The CAVA corpus, Proceedings of the 10th international conference on Multimodal interfaces, IMCI '08, pp.109-116, 2008. ,
DOI : 10.1145/1452392.1452414
URL : https://hal.archives-ouvertes.fr/inria-00373173
A graphical model for audiovisual object tracking, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.25, issue.7, pp.828-836, 2003. ,
DOI : 10.1109/TPAMI.2003.1206512
Hypothesis testing for evaluating a multimodal pattern recognition framework applied to speaker detection, Journal of NeuroEngineering and Rehabilitation, vol.5, issue.1, p.11, 2008. ,
DOI : 10.1186/1743-0003-5-11
Extraction of audio features specific to speech production for multimodal speaker detection. Multimedia, IEEE Transactions on, vol.10, issue.1, pp.63-73, 2008. ,
Pattern Recognition and Machine Learning, 2006. ,
Multiple person and speaker activity tracking with a particle filter, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing, pp.881-884, 2004. ,
DOI : 10.1109/ICASSP.2004.1327252
Integrating pitch and localisation cues at a speech fragment level, Proc. of Interspeech, pp.2769-2772, 2007. ,
A cluster separation measure. Pattern Analysis and Machine Intelligence, IEEE Transactions, issue.12, pp.224-227, 1979. ,
Maximum likelihood from incomplete data via the EM algorithm, Journal of the Royal Statistical Society. Series B (Methodological), vol.39, issue.1, pp.1-38, 1977. ,
Speaker Association With Signal-Level Audiovisual Fusion, IEEE Transactions on Multimedia, vol.6, issue.3, pp.406-413, 2004. ,
DOI : 10.1109/TMM.2004.827503
Audiovisual Probabilistic Tracking of Multiple Speakers in Meetings, IEEE Transactions on Audio, Speech and Language Processing, vol.15, issue.2, pp.601-616, 2007. ,
DOI : 10.1109/TASL.2006.881678
A Combined Corner and Edge Detector, Procedings of the Alvey Vision Conference 1988, pp.147-151, 1988. ,
DOI : 10.5244/C.2.23
Methods for merging Gaussian mixture components Advances in Data Analysis and Classification, pp.3-34, 2010. ,
Structure Inference for Bayesian Multisensory Scene Understanding, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.30, issue.12, pp.2140-2157, 2008. ,
DOI : 10.1109/TPAMI.2008.25
Nonparametric Hypothesis Tests for Statistical Dependency, IEEE Transactions on Signal Processing, vol.52, issue.8, pp.2234-2249, 2004. ,
DOI : 10.1109/TSP.2004.830994
Estimation consistante de l'ordre de modèles de mélange. Comptes Rendus de l'Académie des Sciences - Series I -Mathematics, pp.243-248, 1998. ,
Conjugate Mixture Models for the Modeling of Visual and Auditory Perception, 2010. ,
URL : https://hal.archives-ouvertes.fr/tel-00584080
Detection and localization of 3d audio-visual objects using unsupervised clustering, Proceedings of the 10th international conference on Multimodal interfaces, IMCI '08, pp.217-224, 2008. ,
DOI : 10.1145/1452392.1452438
URL : https://hal.archives-ouvertes.fr/inria-00373148
Conjugate Mixture Models for Clustering Multimodal Data, Neural Computation, vol.49, issue.3, pp.517-557, 2011. ,
DOI : 10.1007/978-94-011-3436-1
URL : https://hal.archives-ouvertes.fr/inria-00590267
A mixture model and em-based algorithm for class discovery, robust classification, and outlier rejection in mixed labeled/unlabeled data sets. Pattern Analysis and Machine Intelligence, IEEE Transactions on, issue.11, pp.251468-1483, 2003. ,
A joint particle filter for audio-visual speaker tracking, Proceedings of the 7th international conference on Multimodal interfaces , ICMI '05, pp.61-68, 2005. ,
DOI : 10.1145/1088463.1088477
Text classification from labeled and unlabeled documents using EM, Machine Learning, vol.39, issue.2/3, pp.103-134, 2000. ,
DOI : 10.1023/A:1007692713085
The topography of multivariate normal mixtures. The Annals of Statistics, pp.2042-2065, 2005. ,
Estimating the Dimension of a Model, The Annals of Statistics, vol.6, issue.2, pp.461-464, 1978. ,
DOI : 10.1214/aos/1176344136
Joint Audio-Visual Tracking Using Particle Filters, EURASIP Journal on Advances in Signal Processing, vol.2002, issue.11, pp.1154-1164, 2002. ,
DOI : 10.1155/S1110865702206058
URL : http://doi.org/10.1155/s1110865702206058