MMD-FUSE: Learning and Combining Kernels for Two-Sample Testing Without Data Splitting - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Pré-Publication, Document De Travail Année : 2023

MMD-FUSE: Learning and Combining Kernels for Two-Sample Testing Without Data Splitting

Résumé

We propose novel statistics which maximise the power of a two-sample test based on the Maximum Mean Discrepancy (MMD), by adapting over the set of kernels used in defining it. For finite sets, this reduces to combining (normalised) MMD values under each of these kernels via a weighted soft maximum. Exponential concentration bounds are proved for our proposed statistics under the null and alternative. We further show how these kernels can be chosen in a data-dependent but permutation-independent way, in a wellcalibrated test, avoiding data splitting. This technique applies more broadly to general permutation-based MMD testing, and includes the use of deep kernels with features learnt using unsupervised models such as auto-encoders. We highlight the applicability of our MMD-FUSE test on both synthetic lowdimensional and real-world high-dimensional data, and compare its performance in terms of power against current state-of-the-art kernel tests.
Fichier principal
Vignette du fichier
2306.08777.pdf (2.68 Mo) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-04156329 , version 1 (08-07-2023)
hal-04156329 , version 2 (08-11-2023)

Licence

Paternité - Pas d'utilisation commerciale - Partage selon les Conditions Initiales

Identifiants

Citer

Felix Biggs, Antonin Schrab, Arthur Gretton. MMD-FUSE: Learning and Combining Kernels for Two-Sample Testing Without Data Splitting. 2023. ⟨hal-04156329v2⟩
36 Consultations
60 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More