Finite-Time Analysis of Stratified Sampling for Monte Carlo

Alexandra Carpentier; Rémi Munos

Communication Dans Un Congrès Année : 2011

Finite-Time Analysis of Stratified Sampling for Monte Carlo

(1) , (1)

Alexandra Carpentier

Fonction : Auteur
PersonId : 910455

Sequential Learning

Rémi Munos

Fonction : Auteur
PersonId : 836863

Sequential Learning

Résumé

We consider the problem of stratified sampling for Monte-Carlo integration. We model this problem in a multi-armed bandit setting, where the arms represent the strata, and the goal is to estimate a weighted average of the mean values of the arms. We propose a strategy that samples the arms according to an upper bound on their standard deviations and compare its estimation quality to an ideal allocation that would know the standard deviations of the strata. We provide two regret analyses: a distribution-dependent bound $\widetilde O(n^{-3/2})$ that depends on a measure of the disparity of the strata, and a distribution-free bound $\widetilde O(n^{-4/3})$ that does not.

Domaines

Autres [stat.ML]

Fichier principal

mc-ucb_3.pdf (319.56 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Alexandra Carpentier : Connectez-vous pour contacter le contributeur

https://inria.hal.science/inria-00636924

Soumis le : lundi 27 février 2012-17:46:09

Dernière modification le : vendredi 24 mars 2023-14:52:55

Archivage à long terme le : jeudi 14 juin 2012-17:00:24

Dates et versions

inria-00636924 , version 1 (28-10-2011)

inria-00636924 , version 2 (13-01-2012)

inria-00636924 , version 3 (27-02-2012)

Identifiants

HAL Id : inria-00636924 , version 3

Citer

Alexandra Carpentier, Rémi Munos. Finite-Time Analysis of Stratified Sampling for Monte Carlo. NIPS - Twenty-Fifth Annual Conference on Neural Information Processing Systems, Dec 2011, Grenade, Spain. ⟨inria-00636924v3⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-LILLE3 CNRS INRIA LAGIS INRIA2

235 Consultations

248 Téléchargements

Finite-Time Analysis of Stratified Sampling for Monte Carlo

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager