HAL will be down for maintenance from Friday, June 10 at 4pm through Monday, June 13 at 9am. More information
Skip to Main content Skip to Navigation
Conference papers

Bootstrap Your Own Latent: A new approach to self-supervised learning

Abstract : We investigate and provide new insights on the sampling rule called Top-Two Thompson Sampling (TTTS). In particular, we justify its use for fixed-confidence best-arm identification. We further propose a variant of TTTS called Top-Two Transportation Cost (T3C), which disposes of the computational burden of TTTS. As our main contribution, we provide the first sample complexity analysis of TTTS and T3C when coupled with a very natural Bayesian stopping rule, for bandits with Gaussian rewards, solving one of the open questions raised by Russo (2016). We also provide new posterior convergence results for TTTS under two models that are commonly used in practice: bandits with Gaussian and Bernoulli rewards and conjugate priors.
Document type :
Conference papers
Complete list of metadata

Contributor : Michal Valko Connect in order to contact the contributor
Submitted on : Thursday, February 25, 2021 - 4:52:13 PM
Last modification on : Tuesday, February 15, 2022 - 11:02:04 AM


Files produced by the author(s)


  • HAL Id : hal-02869787, version 2
  • ARXIV : 1910.10945


Jean-Bastien Grill, Florian Strub, Florent Altché, Corentin Tallec, Pierre Richemond, et al.. Bootstrap Your Own Latent: A new approach to self-supervised learning. Neural Information Processing Systems, 2020, Montréal, Canada. ⟨hal-02869787v2⟩



Record views


Files downloads