N. Dehak, P. J. Kenny, R. Dehak, P. Dumouchel, and P. Ouellet, Front-End Factor Analysis for Speaker Verification, IEEE Transactions on Audio, Speech, and Language Processing, vol.19, issue.4, pp.788-798, 2011.
DOI : 10.1109/TASL.2010.2064307

C. S. Greenberg, D. Bansé, G. R. Doddington, D. Garcia-romero, J. J. Godfrey et al., The NIST 2014 Speaker Recognition I-Vector Machine Learning Challenge, Proc. of Odyssey: The Speaker and Language Recognition Workshop, pp.224-230, 2014.

D. Garcia-romero and C. Y. Espy-wilson, Analysis of Ivector length normalization in speaker recognition systems, Proc. of Interspeech, pp.249-252, 2011.

P. Bousquet, D. Matrouf, and J. Bonastre, Intersession Compensation and Scoring Methods in the i-vectors Space for Speaker Recognition, Proc. of Interspeech, pp.485-488, 2011.
URL : https://hal.archives-ouvertes.fr/hal-01313266

S. J. Prince and J. H. Elder, Probabilistic Linear Discriminant Analysis for Inferences About Identity, 2007 IEEE 11th International Conference on Computer Vision, pp.1-8, 2007.
DOI : 10.1109/ICCV.2007.4409052

D. D. Lee and H. S. Seung, Learning the parts of objects by non-negative matrix factorization, Nature, vol.401, issue.6755, pp.788-791, 1999.

A. Hurmalainen, R. Saeidi, and T. Virtanen, Noise Robust Speaker Recognition with Convolutive Sparse Coding, Proc. of Interspeech, 2015.

A. Hurmalainen, R. Saeidi, and T. Virtanen, Similarity induced group sparsity for non-negative matrix factorisation, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp.4425-4429, 2015.
DOI : 10.1109/ICASSP.2015.7178807

R. Saeidi, A. Hurmalainen, T. Virtanen, and A. Van-leeuwen, Exemplar-based Sparse Representation and Sparse Discrimination for Noise Robust Speaker Identification, Proc. of Odyssey 2012: The Speaker and Language Recognition Workshop, 2012.

N. Seichepine, S. Essid, C. Fevotte, and O. Cappe, Soft Nonnegative Matrix Co-Factorization, IEEE Transactions on Signal Processing, vol.62, issue.22, pp.5940-5949, 2014.
DOI : 10.1109/TSP.2014.2360141

URL : https://hal.archives-ouvertes.fr/hal-01116863

H. Lee and S. Choi, Group nonnegative matrix factorization for EEG classification, Proc. of AISTATS, pp.320-327, 2009.

R. Serizel, S. Essid, and G. Richard, Group nonnegative matrix factorisation with speaker and session variability compensation for speaker identification, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp.2016-5470
DOI : 10.1109/ICASSP.2016.7472723

J. Mairal, F. Bach, and J. Ponce, Task-Driven Dictionary Learning, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.34, issue.4, pp.791-804, 2012.
DOI : 10.1109/TPAMI.2011.156

URL : https://hal.archives-ouvertes.fr/inria-00521534

V. Bisot, R. Serizel, S. Essid, and G. Richard, Feature Learning with Matrix Factorization Applied to Acoustic Scene Classification HAL-archives ouvertes: working paper or preprint, 2016.

P. Sprechmann, A. M. Bronstein, and G. Sapiro, Supervised non-euclidean sparse NMF via bilevel optimization with applications to speech enhancement, 2014 4th Joint Workshop on Hands-free Speech Communication and Microphone Arrays (HSCMA), pp.11-15, 2014.
DOI : 10.1109/HSCMA.2014.6843241

D. D. Lee and H. S. Seung, Algorithms for non-negative matrix factorization, Proc. of NIPS, pp.556-562, 2000.

S. Kullback and R. Leibler, On information and sufficiency The annals of mathematical statistics, pp.79-86, 1951.

H. Zou and T. Hastie, Regularization and variable selection via the elastic net, Journal of the Royal Statistical Society: Series B (Statistical Methodology), vol.5, issue.2, pp.301-320, 2005.
DOI : 10.1073/pnas.201162998

G. Gravier, J. Bonastre, E. Geoffrois, S. Galliano, K. M. Tait et al., ESTER, une campagne d'´ evaluation des systemes d'indexation automatique d'´ emissions radiophoniques en français, Proc. of Journées d'Etude sur la Parole, 2004.

M. Rouvier, G. Dupuy, P. Gay, and E. Khoury, An opensource state-of-the-art toolbox for broadcast news diarization, Proc. of Interspeech, 2013.
URL : https://hal.archives-ouvertes.fr/hal-01433449

B. Mathieu, S. Essid, T. Fillon, J. Prado, and G. Richard, YAAFE, an easy to use and efficient audio feature extraction software, Proc. of ISMIR, pp.441-446, 2010.

S. Davis and P. Mermelstein, Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences, IEEE Transactions on Acoustics, Speech, and Signal Processing, vol.28, issue.4, pp.357-366, 1980.
DOI : 10.1109/TASSP.1980.1163420

F. Bastien, P. Lamblin, R. Pascanu, J. Bergstra, I. J. Goodfellow et al., Theano: new features and speed improvements, Deep Learning and Unsupervised Feature Learning NIPS 2012 Workshop, 2012.

J. Brown, spectral transform, The Journal of the Acoustical Society of America, vol.89, issue.1, pp.425-434, 1991.
DOI : 10.1121/1.400476

F. Pedregosa, G. Varoquaux, A. Gramfort, V. Michel, B. Thirion et al., Scikit-learn: Machine learning in python, The Journal of Machine Learning Research, vol.12, pp.2825-2830, 2011.
URL : https://hal.archives-ouvertes.fr/hal-00650905

J. Mairal, F. Bach, J. Ponce, and G. Sapiro, Online learning for matrix factorization and sparse coding, The Journal of Machine Learning Research, vol.11, pp.19-60, 2010.
URL : https://hal.archives-ouvertes.fr/inria-00408716

Q. Mcnemar, Note on the sampling error of the difference between correlated proportions or percentages, Psychometrika, vol.12, issue.2, pp.153-157, 1947.
DOI : 10.1007/BF02295996