N. Akakpo, Adaptation to anisotropy and inhomogeneity via dyadic piecewise polynomial selection, Mathematical Methods of Statistics, vol.21, issue.1, 2010.
DOI : 10.3103/S1066530712010012

URL : https://hal.archives-ouvertes.fr/hal-00565918

N. Akakpo and C. Lacour, Inhomogeneous and anisotropic conditional density estimation from dependent data, Electronic Journal of Statistics, vol.5, issue.0, 2011.
DOI : 10.1214/11-EJS653

URL : https://hal.archives-ouvertes.fr/hal-00557307

A. Antoniadis, J. Bigot, and R. Sachs, A Multiscale Approach for Statistical Characterization of Functional Images, Journal of Computational and Graphical Statistics, vol.18, issue.1, pp.216-237, 2008.
DOI : 10.1198/jcgs.2009.0013

URL : https://hal.archives-ouvertes.fr/hal-00627384

A. Barron, C. Huang, J. Li, and X. Luo, MDL Principle, Penalized Likelihood, and Statistical Risk, chapter in Festschrift in Honor of Jorma Rissanen on the Occasion of his 75th Birthday, 2008.

D. Bashtannyk and R. Hyndman, Bandwidth selection for kernel conditional density estimation, Computational Statistics & Data Analysis, vol.36, issue.3, pp.279-298, 2001.
DOI : 10.1016/S0167-9473(00)00046-3

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.27.2316

L. Bertrand, M. Languille, S. Cohen, L. Robinet, C. Gervais et al., European research platform IPANEMA at the SOLEIL synchrotron for ancient and historical materials, Journal of Synchrotron Radiation, vol.100, issue.6013, 2011.
DOI : 10.1107/S090904951102334X

URL : https://hal.archives-ouvertes.fr/hal-00618143

. Ch, G. Biernacki, G. Celeux, F. Govaert, and . Langrognet, Model-based cluster and discriminant analysis with the MIXMOD software, Comput. Statist. Data Anal, vol.51, issue.2, pp.587-600, 2006.

L. Birgé, Model selection for Gaussian regression with random design, Bernoulli, vol.10, issue.6, pp.1039-1051, 2004.
DOI : 10.3150/bj/1106314849

L. Birgé and P. Massart, Minimum Contrast Estimators on Sieves: Exponential Bounds and Rates of Convergence, Bernoulli, vol.4, issue.3, pp.329-375, 1998.
DOI : 10.2307/3318720

L. Birgé and P. Massart, Minimal penalties for gaussian model selection. Probability theory and related fields, pp.33-73, 2007.

G. Blanchard, C. Schäfer, Y. Rozenholc, and K. R. Müller, Optimal dyadic decision trees, Machine Learning, pp.209-241, 2007.
DOI : 10.1007/s10994-007-0717-6

URL : https://hal.archives-ouvertes.fr/hal-00264988

S. Boucheron and P. Massart, A high-dimensional Wilks phenomenon. Probability Theory and Related Fields, pp.1-29, 2010.
URL : https://hal.archives-ouvertes.fr/hal-00945509

E. Brunel, F. Comte, and C. Lacour, Adaptive estimation of the conditional density in presence of censoring. Sankhy¯ a, pp.734-763, 2007.
URL : https://hal.archives-ouvertes.fr/hal-00152794

O. Catoni, Pac-Bayesian Supervised Classification: The Thermodynamics of Statistical Learning, Lecture Notes?Monograph Series. Institute of Mathematical Statistics, vol.56, 2007.
URL : https://hal.archives-ouvertes.fr/hal-00206119

S. Cohen and E. L. Pennec, Conditional density estimation by penalized likelihood model selection and applications, 1103.
URL : https://hal.archives-ouvertes.fr/inria-00575462

S. Cohen and E. L. Pennec, Unsupervised segmentation of hyperspectral images with spatialized gaussian mixture model and model selection, 2012.

J. De-gooijer and D. Zerom, On Conditional Density Estimation, Statistica Neerlandica, vol.4, issue.2, pp.159-176, 2003.
DOI : 10.1017/S0266466602181096

D. Donoho, CART and best-ortho-basis: a connection, The Annals of Statistics, vol.25, issue.5, pp.1870-1911, 1997.
DOI : 10.1214/aos/1069362377

S. Efromovich, Conditional density estimation in a regression setting, The Annals of Statistics, vol.35, issue.6, pp.2504-2535, 2007.
DOI : 10.1214/009053607000000253

S. Efromovich, Oracle inequality for conditional density estimation and an actuarial example, Annals of the Institute of Statistical Mathematics, vol.8, issue.2, pp.249-275, 2010.
DOI : 10.1007/s10463-008-0185-1

J. Fan and T. Yim, A crossvalidation method for estimating conditional densities, Biometrika, vol.91, issue.4, pp.819-834, 2004.
DOI : 10.1093/biomet/91.4.819

J. Fan, Q. Yao, and H. Tong, Estimation of conditional densities and sensitivity measures in nonlinear dynamical systems, Biometrika, vol.83, issue.1, pp.189-206, 1996.
DOI : 10.1093/biomet/83.1.189

J. Fan, C. Zhang, and J. Zhang, Generalized Likelihood Ratio Statistics and Wilks Phenomenon, The Annals of Statistics, vol.29, issue.1, pp.153-193, 2001.
DOI : 10.1214/aos/996986505

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.42.6699

. Ch, L. Genovese, and . Wasserman, Rates of convergence for the Gaussian mixture sieve, Ann. Statist, vol.28, issue.4, pp.1105-1127, 2000.

L. Györfi and M. Kohler, Nonparametric Estimation of Conditional Distributions, IEEE Transactions on Information Theory, vol.53, issue.5, pp.1872-1879, 2007.
DOI : 10.1109/TIT.2007.894631

P. Hall, R. Wolff, and Q. Yao, Methods for Estimating a Conditional Distribution Function, Journal of the American Statistical Association, vol.59, issue.445, pp.154-163, 1999.
DOI : 10.1080/01621459.1998.10474104

P. Hall, J. Racine, and Q. Li, Cross-Validation and the Estimation of Conditional Probability Densities, Journal of the American Statistical Association, vol.99, issue.468, pp.1015-1026, 2004.
DOI : 10.1198/016214504000000548

T. Hofmann, Probabilistic latent semantic analysis, Proc. of Uncertainty in Artificial Intelligence, 1999.

Y. Huang, I. Pollak, M. Do, and C. Bouman, Fast search for best representations in multitree dictionaries, IEEE Transactions on Image Processing, vol.15, issue.7, pp.1779-1793, 2006.
DOI : 10.1109/TIP.2006.873465

R. Hyndman and Q. Yao, Nonparametric Estimation and Symmetry Tests for Conditional Density Functions, Journal of Nonparametric Statistics, vol.93, issue.1, pp.259-278, 2002.
DOI : 10.1080/10485250212374

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.196.6614

R. Hyndman, D. Bashtannyk, and G. Grunwald, Estimating and visualizing conditional densities, Journal of Computational and Graphical Statistics, vol.5, pp.315-336, 1996.
DOI : 10.1080/10618600.1996.10474715

B. Karaivanov and P. Petrushev, Nonlinear piecewise polynomial approximation beyond Besov spaces, Applied and Computational Harmonic Analysis, vol.15, issue.3, pp.177-223, 2003.
DOI : 10.1016/j.acha.2003.08.002

URL : http://doi.org/10.1016/j.acha.2003.08.002

E. Kolaczyk and R. Nowak, Multiscale generalised linear models for nonparametric function estimation, Biometrika, vol.92, issue.1, pp.119-133, 2005.
DOI : 10.1093/biomet/92.1.119

E. Kolaczyk, J. Ju, and S. Gopal, Multiscale, Multigranular Statistical Image Segmentation, Journal of the American Statistical Association, vol.100, issue.472, pp.1358-1369, 2005.
DOI : 10.1198/016214505000000385

M. Kosorok, Introduction to Empirical Processes and Semiparametric Inference, 2008.
DOI : 10.1007/978-0-387-74978-5

Q. Li and J. Racine, Nonparametric Econometrics: Theory and Practice, 2007.

J. Lin, Divergence measures based on the Shannon entropy. Information Theory, IEEE Transactions on, vol.37, issue.1, pp.145-151, 1991.

P. Massart, Concentration inequalities and model selection Lectures from the 33rd Summer School on Probability Theory held in Saint-Flour, Lecture Notes in Mathematics, vol.1896, 2003.

C. Maugis and B. Michel, A non asymptotic penalized criterion for Gaussian mixture model selection, ESAIM: Probability and Statistics, vol.15, 2010.
DOI : 10.1051/ps/2009004

URL : https://hal.archives-ouvertes.fr/inria-00284613

C. Maugis and B. Michel, Erratum on "a non asymptotic penalized criterion for Gaussian mixture model selection, Available on their webpage, 2010.

C. Maugis and B. Michel, Data-driven penalty calibration: A case study for Gaussian mixture model selection, ESAIM: Probability and Statistics, vol.15, 2011.
DOI : 10.1051/ps/2010002

URL : https://hal.archives-ouvertes.fr/hal-00666813

M. Rosenblatt, Conditional probability density and regression estimators, Multivariate Analysis, II (Proc. Second Internat. Sympos, pp.25-31, 1968.

L. Si and R. Jin, Adjusting Mixture Weights of Gaussian Mixture Model via Regularized Probabilistic Latent Semantic Analysis, Advances in Knowledge Discovery and Data Mining, pp.218-252, 2005.
DOI : 10.1007/11430919_72

. Ch and . Stone, The use of polynomial splines and their tensor products in multivariate function estimation, Ann. Statist, vol.22, issue.1, pp.118-171, 1994.

S. Szarek, Metric entropy of homogeneous spaces, pp.395-410, 1997.

S. Van-de-geer, The method of sieves and minimum contrast estimators, Math. Methods Statist, vol.4, pp.20-38, 1995.

A. Van-der-vaart and J. Wellner, Weak Convergence, 1996.
DOI : 10.1007/978-1-4757-2545-2_3

I. Van-keilegom and N. Veraverbeke, Density and hazard estimation in censored regression models, Bernoulli, vol.8, issue.5, pp.607-625, 2002.

H. White, Maximum Likelihood Estimation of Misspecified Models, Econometrica, vol.50, issue.1, pp.1-25, 1992.
DOI : 10.2307/1912526

S. Wilks, The Large-Sample Distribution of the Likelihood Ratio for Testing Composite Hypotheses, The Annals of Mathematical Statistics, vol.9, issue.1, pp.60-62, 1938.
DOI : 10.1214/aoms/1177732360

R. Willett and R. Nowak, Multiscale Poisson Intensity and Density Estimation, IEEE Transactions on Information Theory, vol.53, issue.9, pp.3171-3187, 2007.
DOI : 10.1109/TIT.2007.903139

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.119.2643

D. Young and D. Hunter, Mixtures of regressions with predictor-dependent mixing proportions, Computational Statistics & Data Analysis, vol.54, issue.10, pp.2253-2266, 2010.
DOI : 10.1016/j.csda.2010.04.002