A low variance consistent test of relative dependency - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Pré-Publication, Document De Travail Année : 2014

A low variance consistent test of relative dependency

Résumé

We describe a novel non parametric statistical hypothesis test of relative dependence between a source variable and two candidate target variables. Such a test enables one to answer whether one source variable is significantly more dependent on the first target variable or the second. Dependence is measured via the Hilbert-Schmidt Independence Criterion (HSIC), resulting in a pair of empirical dependence measures (source-target 1, source-target 2). Modeling the covariance between these HSIC statistics leads to a provably more powerful test than the construction of independent HSIC statistics by sub sampling. The resulting test is consistent and unbiased, and (being based on U-statistics) has favorable convergence properties. The test can be computed in quadratic time, matching the computational complexity of standard empirical HSIC estimators. We demonstrate the effectiveness of the test on a real-world linguistics problem of identifying language groups from a multilingual corpus.

Domaines

Calcul [stat.CO]
Fichier principal
Vignette du fichier
reldep.pdf (1.33 Mo) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-01005828 , version 1 (13-06-2014)
hal-01005828 , version 2 (13-06-2014)
hal-01005828 , version 3 (11-07-2014)
hal-01005828 , version 4 (20-05-2015)

Identifiants

  • HAL Id : hal-01005828 , version 1

Citer

Wacha Bounliphone, Arthur Gretton, Matthew Blaschko. A low variance consistent test of relative dependency. 2014. ⟨hal-01005828v1⟩
849 Consultations
650 Téléchargements

Partager

Gmail Facebook X LinkedIn More