Challenging the empirical mean and empirical variance: a deviation study - Inria - Institut national de recherche en sciences et technologies du numérique Access content directly
Preprints, Working Papers, ... Year : 2010

Challenging the empirical mean and empirical variance: a deviation study

Abstract

We present new M-estimators of the mean and variance of real valued random variables, based on PAC-Bayes bounds. We analyze the non-asymptotic minimax properties of the deviations of those estimators for sample distributions having either a bounded variance or a bounded variance and a bounded kurtosis. Under those weak hypotheses, allowing for heavy-tailed distributions, we show that the worst case deviations of the empirical mean are suboptimal. We prove indeed that for any confidence level, there is some M-estimator whose deviations are of the same order as the deviations of the empirical mean of a Gaussian statistical sample, even when the statistical sample is instead heavy-tailed. Experiments reveal that these new estimators perform even better than predicted by our bounds, showing deviation quantile functions uniformly lower at all probability levels than the empirical mean for non Gaussian sample distributions as simple as the mixture of two Gaussian measures.

Dates and versions

hal-00517206 , version 1 (13-09-2010)

Identifiers

Cite

Olivier Catoni. Challenging the empirical mean and empirical variance: a deviation study. 2010. ⟨hal-00517206⟩
222 View
0 Download

Altmetric

Share

Gmail Facebook X LinkedIn More