F. Abramovich and T. Lahav, Sparse additive regression on a regular lattice, Journal of the Royal Statistical Society: Series B (Statistical Methodology), vol.68, issue.2, pp.443-459, 2015.
DOI : 10.1111/rssb.12075

G. I. Allen, L. Grosenick, and J. Taylor, A Generalized Least-Square Matrix Decomposition, Journal of the American Statistical Association, vol.17, issue.505, pp.145-159, 2014.
DOI : 10.1214/009053607000000127

. Alquier, Bayesian Methods for Low-Rank Matrix Estimation: Short Survey and Theoretical Study, Algorithmic Learning Theory 2013, pp.309-323, 2013.
DOI : 10.1007/978-3-642-40935-6_22

V. Alquier, N. Cottet, J. Chopin, and . Rousseau, Bayesian matrix completion: prior specification, p.26, 2014.

J. Alquier, N. Ridgway, and . Chopin, On the properties of variational approximations of Gibbs posteriors, 2015.

D. P. Bertsekas, Nonlinear programming, Athena Scientific, issue.8, 1999.

M. Bishop, Pattern Recognition and Machine Learning, chapter 10, p.9, 2006.

C. Bissiri, S. Holmes, and . Walker, A general framework for updating belief distributions, Journal of the Royal Statistical Society: Series B (Statistical Methodology), vol.52, issue.5, 2013.
DOI : 10.1111/rssb.12158

V. Bittorf, B. Recht, C. Re, and J. Tropp, Factoring nonnegative matrices with linear programs, Advances in Neural Information Processing Systems, pp.1214-1222, 2012.

S. Boyd, N. Parikh, E. Chu, B. Peleato, and J. Eckstein, Distributed Optimization and Statistical Learning via the Alternating Direction Method of Multipliers, Machine Learning, pp.1-122, 2011.
DOI : 10.1561/2200000016

. Catoni, A PAC-Bayesian approach to adaptive classification, 2003.

. Catoni, Statistical Learning Theory and Stochastic Optimization. Saint- Flour Summer School on Probability Theory, Lecture Notes in Mathematics, 2001.
URL : https://hal.archives-ouvertes.fr/hal-00104952

. Catoni and . Pac-, Bayesian supervised classification: the thermodynamics of statistical learning, Institute of Mathematical Statistics Lecture Notes? Monograph Series, vol.56, 2007.
URL : https://hal.archives-ouvertes.fr/hal-00206119

J. Corander and M. Villani, Bayesian assessment of dimensionality in reduced rank regression, Statistica Neerlandica, vol.36, issue.3, pp.255-270, 2004.
DOI : 10.1111/1467-9892.00212

A. Dalalyan and A. B. Tsybakov, Aggregation by exponential weighting, sharp PAC-Bayesian bounds and sparsity, Machine Learning, pp.39-61, 2008.
DOI : 10.1007/s10994-008-5051-0

URL : https://hal.archives-ouvertes.fr/hal-00291504

L. Cun, B. Boser, J. S. Denker, D. Henderson, R. E. Howard et al., Handwritten digit recognition with a back-propagation network, Advances in neural information processing systems. Citeseer, p.13, 1990.

D. D. Lee and H. S. Seung, Learning the parts of objects by non-negative matrix factorization, Nature, vol.401, issue.6755, pp.788-791, 1999.

D. D. Lee and H. S. Seung, Algorithms for non-negative matrix factorization, Advances in neural information processing systems, pp.556-562, 2001.

G. Leung and A. R. Barron, Information Theory and Mixing Least-Squares Regressions, IEEE Transactions on Information Theory, vol.52, issue.8, pp.3396-3410, 2006.
DOI : 10.1109/TIT.2006.878172

B. Li, S. Guedj, and . Loustau, PAC-Bayesian online clustering. arXiv preprint, p.19, 2016.

Y. J. Lim and Y. W. Teh, Variational Bayesian approach to movie rating prediction, Proceedings of KDD Cup and Workshop, pp.15-21, 2007.

C. Lin, Projected Gradient Methods for Nonnegative Matrix Factorization, Neural Computation, vol.5, issue.10, pp.2756-2779, 2007.
DOI : 10.1007/BF01584660

D. J. Mackay, Information Theory, Inference and Learning Algorithms, 2002.

T. T. Mai and P. Alquier, A Bayesian approach for noisy matrix completion: Optimal rate under general sampling distribution, Electronic Journal of Statistics, vol.9, issue.1, pp.823-841, 2015.
DOI : 10.1214/15-EJS1020

D. Mcallester, Some PAC-Bayesian theorems, Proceedings of the eleventh annual conference on Computational learning theory , COLT' 98, pp.230-234, 1998.
DOI : 10.1145/279943.279989

D. Moussaoui, A. Brie, C. Mohammad-djafari, and . Carteret, Separation of Non-Negative Mixture of Non-Negative Sources Using a Bayesian Approach and MCMC Sampling, IEEE Transactions on Signal Processing, vol.54, issue.11, pp.4133-4145, 2006.
DOI : 10.1109/TSP.2006.880310

URL : https://hal.archives-ouvertes.fr/hal-00121602

A. Ozerov and C. Févotte, Multichannel Nonnegative Matrix Factorization in Convolutive Mixtures for Audio Source Separation, IEEE Transactions on Audio, Speech, and Language Processing, vol.18, issue.3, pp.550-563, 2010.
DOI : 10.1109/TASL.2009.2031510