22078 articles – 15904 references  [version française]

hal-00262478, version 2

Model selection by resampling penalization

Sylvain Arlot () 12

Electronic Journal of Statistics 3 (2009) 557--624

Abstract: In this paper, a new family of resampling-based penalization procedures for model selection is defined in a general framework. It generalizes several methods, including Efron's bootstrap penalization and the leave-one-out penalization recently proposed by Arlot (2008), to any exchangeable weighted bootstrap resampling scheme. In the heteroscedastic regression framework, assuming the models to have a particular structure, these resampling penalties are proved to satisfy a non-asymptotic oracle inequality with leading constant close to 1. In particular, they are asympotically optimal. Resampling penalties are used for defining an estimator adapting simultaneously to the smoothness of the regression function and to the heteroscedasticity of the noise. This is remarkable because resampling penalties are general-purpose devices, which have not been built specifically to handle heteroscedastic data. Hence, resampling penalties naturally adapt to heteroscedasticity. A simulation study shows that resampling penalties improve on V-fold cross-validation in terms of final prediction error, in particular when the signal-to-noise ratio is not large.

  • 1:  Laboratoire d'informatique de l'école normale supérieure (LIENS)
  • CNRS : UMR8548 – Ecole normale supérieure de Paris - ENS Paris
  • 2:  WILLOW (INRIA Rocquencourt)
  • INRIA – Ecole normale supérieure de Paris - ENS Paris – Ecole des Ponts ParisTech – CNRS : UMR8548
  • Domain : Mathematics/Statistics
    Statistics/Statistics Theory
  • Keywords : non-parametric statistics – resampling – exchangeable weighted bootstrap – model selection – penalization – non-parametric regression – adaptivity – heteroscedastic data – regressogram – histogram selection
  • Comment : extended version of http://hal.archives-ouvertes.fr/hal-00125455 – with a technical appendix
  • Available versions :  v1 (2008-03-11) v2 (2009-06-17)
 
  • hal-00262478, version 2
  • oai:hal.archives-ouvertes.fr:hal-00262478
  • From: 
  • Submitted on: Wednesday, 17 June 2009 11:28:29
  • Updated on: Friday, 19 June 2009 11:55:57