F. L. Lewis, Wireless sensor networks Smart environments: technologies, protocols, and applications, pp.11-46, 2004.

M. Armbrust, A. Fox, R. Griffith, A. D. , R. H. Katz et al., A view of cloud computing, Communications of the ACM, vol.53, issue.4, pp.50-58, 2010.
DOI : 10.1145/1721654.1721672

A. C. Yao, Some complexity questions related to distributive computing, STOC, pp.209-213, 1979.
DOI : 10.1145/800135.804414

H. Abelson, Lower Bounds on Information Transfer in Distributed Computations, Journal of the ACM, vol.27, issue.2, pp.384-392, 1980.
DOI : 10.1145/322186.322200

H. Daumé, I. , J. M. Phillips, A. Saha, and S. Venkatasubramanian, Protocols for learning classifiers on distributed data, AISTATS, pp.282-290, 2012.

O. Shamir, Fundamental Limits of Online and Distributed Algorithms for Statistical Learning and Estimation, NIPS, 2014.

A. Lazarevic and Z. Obradovic, Boosting Algorithms for Parallel and Distributed Learning. Distributed and Parallel Databases, pp.203-229, 2002.

P. A. Forero, A. Cano, and G. B. Giannakis, Consensus-based distributed linear support vector machines, Proceedings of the 9th ACM/IEEE International Conference on Information Processing in Sensor Networks, IPSN '10, pp.1663-1707, 2010.
DOI : 10.1145/1791212.1791218
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.591.7247

S. P. Boyd, N. Parikh, E. Chu, B. Peleato, and J. Eckstein, Distributed Optimization and Statistical Learning via the Alternating Direction Method of Multipliers, Foundations and Trends?? in Machine Learning, vol.3, issue.1, pp.1-122, 2011.
DOI : 10.1561/2200000016

J. C. Duchi, A. Agarwal, and M. J. Wainwright, Dual Averaging for Distributed Optimization: Convergence Analysis and Network Scaling, IEEE Transactions on Automatic Control, vol.57, issue.3, pp.592-606, 2012.
DOI : 10.1109/TAC.2011.2161027
URL : http://arxiv.org/abs/1005.2012

O. Dekel, R. Gilad-bachrach, O. Shamir, and L. Xiao, Optimal Distributed Online Prediction Using Mini-Batches, pp.165-202, 2012.

M. Balcan, S. Ehrlich, and Y. Liang, Distributed k-means and k-median clustering on general communication topologies, NIPS, 2013.

T. Yang, Trading Computation for Communication: Distributed Stochastic Dual Coordinate Ascent, NIPS, 2013.

R. Tibshirani, Regression shrinkage and selection via the lasso, J. R. Stat. Soc. B, vol.58, issue.1, pp.267-288, 1996.

C. Cortes and V. Vapnik, Support-vector networks, Machine Learning, pp.273-297, 1995.
DOI : 10.1007/BF00994018

F. R. Bach, G. R. Lanckriet, and M. I. Jordan, Multiple kernel learning, conic duality, and the SMO algorithm, Twenty-first international conference on Machine learning , ICML '04, 2004.
DOI : 10.1145/1015330.1015424
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.142.6040

C. Shen and H. Li, On the Dual Formulation of Boosting Algorithms, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.32, issue.12, pp.2216-2231, 2010.
DOI : 10.1109/TPAMI.2010.47

H. Lee, A. Battle, R. Raina, and A. Y. Ng, Efficient sparse coding algorithms, NIPS, pp.801-808, 2006.

M. Frank and P. Wolfe, An algorithm for quadratic programming, Naval Research Logistics Quarterly, vol.3, issue.1-2, pp.95-110, 1956.
DOI : 10.1002/nav.3800030109

K. L. Clarkson, Coresets, sparse greedy approximation, and the Frank-Wolfe algorithm, ACM Transactions on Algorithms, vol.6, issue.4, pp.1-30, 2010.
DOI : 10.1145/1824777.1824783
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.145.9299

M. Jaggi, Revisiting Frank-Wolfe: Projection-Free Sparse Convex Optimization, ICML, 2013.

S. Shalev-shwartz, N. Srebro, and T. Zhang, Trading Accuracy for Sparsity in Optimization Problems with Sparsity Constraints, SIAM Journal on Optimization, vol.20, issue.6, pp.2807-2832, 2010.
DOI : 10.1137/090759574

M. Jaggi, Sparse Convex Optimization Methods for Machine Learning, 2011.

S. Lacoste-julien and M. Jaggi, An Affine Invariant Linear Convergence Analysis for Frank-Wolfe Algorithms, 2013.

A. Bellet, Y. Liang, A. Bagheri-garakani, M. Balcan, and F. Sha, A Distributed Frank-Wolfe Algorithm for Communication-Efficient Sparse Learning, 2014.
DOI : 10.1137/1.9781611974010.54
URL : https://hal.archives-ouvertes.fr/hal-01430851

F. Teofilo and . Gonzalez, Clustering to minimize the maximum intercluster distance, Theoretical Computer Science, vol.38, pp.293-306, 1985.

M. Robert, P. Freund, and . Grigas, New Analysis and Results for the Conditional Gradient Method, 2013.

S. K. Shevade and S. S. Keerthi, A simple and efficient algorithm for gene selection using sparse logistic regression, Bioinformatics, vol.19, issue.17, pp.2246-2253, 2003.
DOI : 10.1093/bioinformatics/btg308
URL : http://bioinformatics.oxfordjournals.org/cgi/content/short/19/17/2246

M. Yuan and Y. Lin, Model selection and estimation in regression with grouped variables, Journal of the Royal Statistical Society: Series B (Statistical Methodology), vol.58, issue.1, pp.49-67, 2006.
DOI : 10.1198/016214502753479356
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.366.4278

I. W. Tsang, J. T. Kwok, and P. Cheung, Core Vector Machines: Fast SVM Training on Very Large Data Sets, JMLR, vol.6, pp.363-392, 2005.

H. Ouyang and A. G. Gray, Fast Stochastic Frank-Wolfe Algorithms for Nonlinear SVMs, SDM, pp.245-256, 2010.
DOI : 10.1137/1.9781611972801.22
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.187.2952

S. Asharaf, M. N. Murty, and S. K. Shevade, Multiclass core vector machine, Proceedings of the 24th international conference on Machine learning, ICML '07, pp.41-48, 2007.
DOI : 10.1145/1273496.1273502
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.145.1650

S. Lacoste-julien, M. Jaggi, M. Schmidt, and P. Pletscher, Block-Coordinate Frank-Wolfe Optimization for Structural SVMs, ICML, 2013.
URL : https://hal.archives-ouvertes.fr/hal-00720158

D. Mosk-aoyama, T. Roughgarden, and D. Shah, Fully Distributed Algorithms for Convex Optimization Problems, SIAM Journal on Optimization, vol.20, issue.6, pp.3260-3279, 2010.
DOI : 10.1137/080743706
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.122.2878

J. N. Tsitsiklis and Z. Luo, Communication complexity of convex optimization, Journal of Complexity, vol.3, issue.3, pp.231-243, 1987.
DOI : 10.1016/0885-064X(87)90013-6

E. Wei and A. E. Ozdaglar, Distributed Alternating Direction Method of Multipliers, 2012 IEEE 51st IEEE Conference on Decision and Control (CDC), pp.5445-5450, 2012.
DOI : 10.1109/CDC.2012.6425904
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.294.8430