Y. Boykov and G. Funka-lea, Graph Cuts and Efficient N-D Image Segmentation, International Journal of Computer Vision, vol.18, issue.9, pp.109-131, 2006.
DOI : 10.1007/s11263-006-7934-5

A. L. Casanovas, Blind audiovisual source separation using sparse redundant representations, Signal Processing Institute, 2006.

N. Dalal and B. Triggs, Histograms of Oriented Gradients for Human Detection, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05), pp.886-893, 2005.
DOI : 10.1109/CVPR.2005.177
URL : https://hal.archives-ouvertes.fr/inria-00548512

T. Darrell, J. W. Fisher, and I. , Speaker association with signal-level audiovisual fusion, IEEE Trans. on Multimedia, vol.6, issue.3, pp.406-413, 2004.

J. Driver, Enhancement of selective listening by illusory mislocation of speech sounds due to lip-reading, Nature, vol.381, issue.6577, pp.66-68, 1996.
DOI : 10.1038/381066a0

J. Hershey and J. R. Movellan, Audio vision: Using audiovisual synchrony to locate sounds, NIPS, pp.813-819, 1999.

K. Kanatani, Motion segmentation by subspace separation and model selection, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001, pp.586-591, 2001.
DOI : 10.1109/ICCV.2001.937679

E. Kidron, Y. Y. Schechner, and M. Elad, Pixels that Sound, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05), pp.88-95, 2005.
DOI : 10.1109/CVPR.2005.274

B. Lucas and T. Kanade, An Iterative Image Registration Technique with an Application to Stereo Vision, Int'l Joint Conf. on Artificial Intelligence, pp.674-679, 1981.

J. Luettin, N. Thacker, and S. Beet, Speaker identification by lipreading, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96, pp.62-65, 1996.
DOI : 10.1109/ICSLP.1996.607030
URL : http://infoscience.epfl.ch/record/82364

G. Monaci, O. Escoda, and P. Vandergheynst, Analysis of multimodal signals using redundant representations, IEEE International Conference on Image Processing 2005, pp.145-148, 2005.
DOI : 10.1109/ICIP.2005.1530349

E. Parzen, On Estimation of a Probability Density Function and Mode, The Annals of Mathematical Statistics, vol.33, issue.3, pp.1065-1076, 1962.
DOI : 10.1214/aoms/1177704472

E. K. Patterson, S. Gurbuz, Z. Tufekci, and J. N. Gowdy, Moving-talker, speakerindependent feature study and baseline results using the cuave multimodal speech corpus, EURASIP J. on Applied Signal Processing, issue.11, pp.1189-1201, 2002.

L. Rabiner and B. Juang, Fundamentals of Speech Recognition, 1993.

A. Renyi, On measures of Entropy and Information, Fourth Berkeley Symp, pp.547-561, 1961.

P. Viola and M. Jones, Robust Real-Time Face Detection, International Journal of Computer Vision, vol.57, issue.2, pp.137-154, 2004.
DOI : 10.1023/B:VISI.0000013087.49260.fb
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.102.9805

D. Xu, J. Principe, and J. Fisher, A Novel Measure for Independent Component Analysis (ICA), Int'l Conf. on Acoustics, Speech and Signal Processing, pp.1161-164, 1998.