F. Bach, Bolasso, Proceedings of the 25th international conference on Machine learning, ICML '08, p.33, 2008.
DOI : 10.1145/1390156.1390161
URL : https://hal.archives-ouvertes.fr/hal-00271289

F. Bach, Self-concordant analysis for logistic regression, Electronic Journal of Statistics, vol.4, issue.0, p.384, 2010.
DOI : 10.1214/09-EJS521
URL : https://hal.archives-ouvertes.fr/hal-00426227

L. Breiman, Random forests, Machine Learning, p.32, 2001.

E. J. Candes, J. K. Romberg, and T. Tao, Stable signal recovery from incomplete and inaccurate measurements, Communications on Pure and Applied Mathematics, vol.7, issue.8, p.1207, 2006.
DOI : 10.1002/cpa.20124

M. K. Carroll, G. A. Cecchi, I. Rish, R. Garg, and A. Rao, Prediction and interpretation of distributed neural activity with sparse models, NeuroImage, vol.44, issue.1, p.112, 2009.
DOI : 10.1016/j.neuroimage.2008.08.020

C. Chang and C. Lin, LIBSVM, ACM Transactions on Intelligent Systems and Technology, vol.2, issue.3, p.27, 2011.
DOI : 10.1145/1961189.1961199

R. E. Fan, K. W. Chang, C. J. Hsieh, X. R. Wang, L. et al., LIBLINEAR: A library for large linear classification, J. Mach. Learn. Res, vol.9, p.1871, 2008.

E. Grave, G. R. Obozinski, and F. Bach, Trace lasso: a trace norm regularization for correlated designs, Adv NIPS, p.2195, 2011.
URL : https://hal.archives-ouvertes.fr/hal-00620197

A. Haury, P. Gestraud, and J. Vert, The Influence of Feature Selection Methods on Accuracy, Stability and Interpretability of Molecular Signatures, PLoS ONE, vol.66, issue.12, p.28210, 2011.
DOI : 10.1371/journal.pone.0028210.t003
URL : https://hal.archives-ouvertes.fr/hal-00559580

J. V. Haxby, I. M. Gobbini, and M. L. Furey, Distributed and Overlapping Representations of Faces and Objects in Ventral Temporal Cortex, Science, vol.293, issue.5539, p.2425, 2001.
DOI : 10.1126/science.1063736

L. Jacob, G. Obozinski, and J. P. Vert, Group lasso with overlap and graph lasso, Proceedings of the 26th Annual International Conference on Machine Learning, ICML '09, p.433, 2009.
DOI : 10.1145/1553374.1553431
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.149.7108

J. Jia and B. Yu, On model consistency of the elastic net when p n, Statistica Sinica, vol.20, p.595, 2010.

K. Jimura and R. Poldrack, Analyses of regional-average activation and multivoxel pattern information tell complementary stories, Neuropsychologia, vol.50, issue.4, p.544, 2012.
DOI : 10.1016/j.neuropsychologia.2011.11.007

S. Mallat and Z. Zhang, Matching pursuits with time-frequency dictionaries, IEEE Transactions on Signal Processing, vol.41, issue.12, p.3397, 1993.
DOI : 10.1109/78.258082
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.335.5769

N. Meinshausen and P. Bühlmann, Stability selection, Journal of the Royal Statistical Society: Series B (Statistical Methodology), vol.7, issue.4, p.417, 2010.
DOI : 10.1111/j.1467-9868.2010.00740.x

V. Michel, A. Gramfort, and G. Varoquaux, A supervised clustering approach for fMRI-based inference of brain states, Pattern Recognition, vol.45, issue.6, p.2041, 2012.
DOI : 10.1016/j.patcog.2011.04.006
URL : https://hal.archives-ouvertes.fr/inria-00589201

T. M. Mitchell, R. Hutchinson, R. S. Niculescu, and F. Pereira, Learning to Decode Cognitive States from Brain Images, Machine Learning, p.145, 2004.
DOI : 10.1023/B:MACH.0000035475.85309.1b

S. Monti, P. Tamayo, J. Mesirov, and T. Golub, Consensus clustering: a resampling-based method for class discovery and visualization of gene expression microarray data, Machine Learning, vol.52, issue.1/2, p.91, 2003.
DOI : 10.1023/A:1023949509487

D. Müllner, Modern hierarchical, agglomerative clustering algorithms Arxiv preprint arXiv:1109, 2011.

A. Ng, regularization, and rotational invariance, Twenty-first international conference on Machine learning , ICML '04, p.78, 2004.
DOI : 10.1145/1015330.1015435

F. Pedregosa, G. Varoquaux, and A. Gramfort, Scikit-learn: Machine learning in Python, J. Mach. Learn. Res, vol.12, p.2825, 2011.
URL : https://hal.archives-ouvertes.fr/hal-00650905

R. Tibshirani, Regression shrinkage and selection via the lasso, J. Roy. Statistical Society B, vol.58, p.267, 1994.

S. M. Tom, C. R. Fox, C. Trepel, and R. Poldrack, The Neural Basis of Loss Aversion in Decision-Making Under Risk, Science, vol.315, issue.5811, p.315515, 2007.
DOI : 10.1126/science.1134239

J. A. Tropp, Greed is Good: Algorithmic Results for Sparse Approximation, IEEE Transactions on Information Theory, vol.50, issue.10, p.2231, 2004.
DOI : 10.1109/TIT.2004.834793

M. Wainwright, Sharp Thresholds for High-Dimensional and Noisy Sparsity Recovery Using <formula formulatype="inline"><tex Notation="TeX">$\ell _{1}$</tex> </formula>-Constrained Quadratic Programming (Lasso), IEEE Transactions on Information Theory, vol.55, issue.5, p.2183, 2009.
DOI : 10.1109/TIT.2009.2016018

M. Yuan and Y. Lin, Model selection and estimation in regression with grouped variables, Journal of the Royal Statistical Society: Series B (Statistical Methodology), vol.58, issue.1, p.49, 2006.
DOI : 10.1198/016214502753479356

P. Zhao and B. Yu, On model selection consistency of lasso, J. Mach. Learn. Res, vol.7, p.2541, 2006.

H. Zou, The Adaptive Lasso and Its Oracle Properties, Journal of the American Statistical Association, vol.101, issue.476, p.1418, 2006.
DOI : 10.1198/016214506000000735

H. Zou and T. Hastie, Regularization and variable selection via the elastic net, Journal of the Royal Statistical Society: Series B (Statistical Methodology), vol.5, issue.2, p.301, 2005.
DOI : 10.1073/pnas.201162998