J. Cech, R. Mittal, A. Deleforge, J. Sanchez-riera, X. Alameda-pineda et al., Active-speaker detection and localization with microphones and cameras embedded into a robotic head, 2013 13th IEEE-RAS International Conference on Humanoid Robots (Humanoids), 2013.
DOI : 10.1109/HUMANOIDS.2013.7029977
URL : https://hal.archives-ouvertes.fr/hal-00861465

M. S. Datum, F. Palmieri, and A. Moiseff, An artificial neural network for sound localization using binaural cues, The Journal of the Acoustical Society of America, vol.100, issue.1, pp.372-383, 1996.
DOI : 10.1121/1.415854

V. Willert, J. Eggert, J. Adamy, R. Stahl, and E. Koerner, A Probabilistic Model for Binaural Sound Localization, IEEE Transactions on Systems, Man and Cybernetics, Part B (Cybernetics), vol.36, issue.5, pp.982-994, 2006.
DOI : 10.1109/TSMCB.2006.872263
URL : http://www1.rtr.tu-darmstadt.de/pdf/willert-eggert-2-2006.pdf

A. Kulaib, M. Mualla, and D. Vernon, 2D Binaural Sound Localization: for Urban Search and Rescue Robotics, Mobile Robotics, pp.9-11, 2009.
DOI : 10.1142/9789814291279_0053

J. Hörnstein, M. Lopes, J. Santos-victor, and F. Lacerda, Sound Localization for Humanoid Robots - Building Audio-Motor Maps based on the HRTF, 2006 IEEE/RSJ International Conference on Intelligent Robots and Systems, pp.1170-1176, 2006.
DOI : 10.1109/IROS.2006.281849

Y. Lu and M. Cooke, Binaural estimation of sound source distance via the direct-to-reverberant energy ratio for static and moving sources, IEEE Transactions on Audio, Speech, and Language Processing, vol.18, pp.1793-1805, 2010.

M. Raspaud, H. Viste, and G. Evangelista, Binaural Source Localization by Joint Estimation of ILD and ITD, IEEE Transactions on Audio, Speech, and Language Processing, vol.18, issue.1, pp.68-77, 2010.
DOI : 10.1109/TASL.2009.2023644

R. Talmon, I. Cohen, and S. Gannot, Supervised source localization using diffusion kernels, 2011 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), pp.245-248, 2011.
DOI : 10.1109/ASPAA.2011.6082267

A. Deleforge and R. Horaud, 2D sound-source localization on the binaural manifold, 2012 IEEE International Workshop on Machine Learning for Signal Processing, 2012.
DOI : 10.1109/MLSP.2012.6349784
URL : https://hal.archives-ouvertes.fr/hal-00768657

Y. Luo, D. N. Zotkin, and R. Duraiswami, Gaussian process models for HRTF based 3D sound localization, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2014.
DOI : 10.1109/ICASSP.2014.6854122
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.460.2995

F. Keyrouz, Advanced Binaural Sound Localization in 3-D for Humanoid Robots, IEEE Transactions on Instrumentation and Measurement, vol.63, issue.9, 2014.
DOI : 10.1109/TIM.2014.2308051

O. Y?lmaz and S. Rickard, Blind Separation of Speech Mixtures via Time-Frequency Masking, IEEE Transactions on Signal Processing, vol.52, issue.7, pp.1830-1847, 2004.
DOI : 10.1109/TSP.2004.828896

P. Aarabi, Self-localizing dynamic microphone arrays, IEEE Transactions on Systems, Man and Cybernetics, Part C (Applications and Reviews), vol.32, issue.4, pp.474-484, 2002.
DOI : 10.1109/TSMCB.2002.804369
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.4.6131

N. Roman, D. Wang, and G. J. Brown, Speech segregation based on sound localization, The Journal of the Acoustical Society of America, vol.114, issue.4, pp.2236-2252, 2003.
DOI : 10.1121/1.1610463
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.22.5314

N. Roman and D. Wang, Binaural Tracking of Multiple Moving Sources, IEEE Transactions on Audio, Speech, and Language Processing, vol.16, issue.4, pp.728-739, 2008.
DOI : 10.1109/TASL.2008.918978

M. I. Mandel, R. J. Weiss, and D. P. Ellis, Model-Based Expectation-Maximization Source Separation and Localization, IEEE Transactions on Audio, Speech, and Language Processing, vol.18, issue.2, pp.382-394, 2010.
DOI : 10.1109/TASL.2009.2029711

S. Lee and H. Park, Multiple reverberant sound localization based on rigorous zero-crossing-based ITD selection, IEEE Signal Processing Letters, vol.17, issue.7, pp.671-674, 2010.

J. Woodruff and D. Wang, Binaural Localization of Multiple Sources in Reverberant and Noisy Environments, IEEE Transactions on Audio, Speech, and Language Processing, vol.20, issue.5, pp.1503-1512, 2012.
DOI : 10.1109/TASL.2012.2183869

A. Deleforge, F. Forbes, and R. Horaud, Variational EM for binaural sound-source separation and localization, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing, 2013.
DOI : 10.1109/ICASSP.2013.6637612
URL : https://hal.archives-ouvertes.fr/hal-00823453

J. Woodruff and D. Wang, Sequential Organization of Speech in Reverberant Environments by Integrating Monaural Grouping and Binaural Localization, IEEE Transactions on Audio, Speech, and Language Processing, vol.18, issue.7, pp.1856-1866, 2010.
DOI : 10.1109/TASL.2010.2050087

P. Viola and M. J. Jones, Robust Real-Time Face Detection, International Journal of Computer Vision, vol.57, issue.2, pp.137-154, 2004.
DOI : 10.1023/B:VISI.0000013087.49260.fb
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.102.9805

A. Deleforge, F. Forbes, and R. Horaud, Acoustic Space Learning for Sound-Source Separation and Localization on Binaural Manifolds, International Journal of Neural Systems, vol.25, issue.01, 2015.
DOI : 10.1142/S0129065714400036
URL : https://hal.archives-ouvertes.fr/hal-00960796

R. D. Cook, Fisher Lecture: Dimension Reduction in Regression, Statistical Science, vol.22, issue.1, pp.1-26, 2007.
DOI : 10.1214/088342306000000682

A. Deleforge, F. Forbes, and R. Horaud, High-dimensional regression with gaussian mixtures and partially-latent response variables, Statistics and Computing, vol.19, issue.11, 2014.
DOI : 10.1007/s11222-014-9461-5
URL : https://hal.archives-ouvertes.fr/hal-01107604

K. C. Li, Sliced Inverse Regression for Dimension Reduction, Journal of the American Statistical Association, vol.13, issue.414, pp.316-327, 1991.
DOI : 10.1214/aos/1176345514

A. Deleforge, V. Drouard, L. Girin, and R. Horaud, Mapping sounds on images using binaural spectrograms, 22nd European Signal Processing Conference, 2014.
URL : https://hal.archives-ouvertes.fr/hal-01019287

X. Zhu and D. Ramanan, Face detection, pose estimation, and landmark localization in the wild, IEEE Conference on Computer Vision and Pattern Recognition, 2012.

J. C. Middlebrooks and D. M. Green, Sound Localization by Human Listeners, Annual Review of Psychology, vol.42, issue.1, pp.135-159, 1991.
DOI : 10.1146/annurev.ps.42.020191.001031

K. Youssef, S. Argentieri, and J. Zarader, Towards a systematic study of binaural cues, 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems, pp.1004-1009, 2012.
DOI : 10.1109/IROS.2012.6385554

M. Aytekin, C. F. Moss, and J. Z. Simon, A Sensorimotor Approach to Sound Localization, Neural Computation, vol.79, issue.2, pp.603-635, 2008.
DOI : 10.1523/JNEUROSCI.0199-04.2004

M. Otani, T. Hirahara, and S. Ise, Numerical study on source-distance dependency of head-related transfer functions, The Journal of the Acoustical Society of America, vol.125, issue.5, pp.3253-61, 2009.
DOI : 10.1121/1.3111860

M. Jordan and R. Jacobs, Hierarchical Mixtures of Experts and the EM Algorithm, Neural Computation, vol.26, issue.2, pp.181-214, 1994.
DOI : 10.1214/aos/1176346060

Y. Qiao and N. Minematsu, Mixture of Probabilistic Linear Regressions: A unified view of GMM-based mapping techiques, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing, pp.3913-3916, 2009.
DOI : 10.1109/ICASSP.2009.4960483

J. S. Garofolo, L. F. Lamel, W. M. Fisher, J. G. Fiscus, and D. S. Pallett, The DARPA TIMIT acoustic-phonetic continuous speech corpus CD- ROM, National Institute of Standards and Technology, 1993.

C. Knapp and G. C. Carter, The generalized correlation method for estimation of time delay, IEEE Transactions on Acoustics, Speech, and Signal Processing, vol.24, issue.4, pp.320-327, 1976.
DOI : 10.1109/TASSP.1976.1162830