Skip to Main content Skip to Navigation
New interface
Journal articles

Variance reduction in purely random forests

Abstract : Random forests, introduced by Leo Breiman in 2001, are a very effective statistical method. The complex mechanism of the method makes theoretical analysis difficult. Therefore, simplified versions of random forests, called purely random forests, which can be theoretically handled more easily, have been considered. In this paper we study the variance of such forests. First, we show a general upper bound which emphasizes the fact that a forest reduces the variance. We then introduce a simple variant of purely random forests, that we call purely uniformly random forests. For this variant and in the context of regression problems with a one-dimensional predictor space, we show that both random trees and random forests reach minimax rate of convergence. In addition, we prove that compared to random trees, random forests improve accuracy by reducing the estimator variance by a factor of three fourths.
Document type :
Journal articles
Complete list of metadata

Cited literature [10 references]  Display  Hide  Download
Contributor : Robin Genuer Connect in order to contact the contributor
Submitted on : Tuesday, September 19, 2017 - 4:34:24 PM
Last modification on : Wednesday, March 24, 2021 - 3:31:14 AM


Files produced by the author(s)


Distributed under a Creative Commons Attribution 4.0 International License




Robin Genuer. Variance reduction in purely random forests. Journal of Nonparametric Statistics, 2012, 2, pp.18 - 562. ⟨10.1007/978-1-4899-0027-2⟩. ⟨hal-01590513⟩



Record views


Files downloads