Using Data Compressors to Construct Rank Tests

Daniil Ryabko 1, * Juergen Schmidhuber 2
* Auteur correspondant
1 SEQUEL - Sequential Learning
LIFL - Laboratoire d'Informatique Fondamentale de Lille, LAGIS - Laboratoire d'Automatique, Génie Informatique et Signal, Inria Lille - Nord Europe
Abstract : Nonparametric rank tests for homogeneity and component independence are proposed, which are based on data compressors. For homogeneity testing the idea is to compress the binary string obtained by ordering the two joint samples and writing 0 if the element is from the first sample and 1 if it is from the second sample and breaking ties by randomization (extension to the case of multiple samples is straightforward). $H_0$ should be rejected if the string is compressed (to a certain degree) and accepted otherwise. We show that such a test obtained from an ideal data compressor is valid against all alternatives. Component independence is reduced to homogeneity testing by constructing two samples, one of which is the first half of the original and the other is the second half with one of the components randomly permuted.
Type de document :
Article dans une revue
Applied Mathematics Letters, Elsevier, 2009, 22 (7), pp.1029-1032
Liste complète des métadonnées

Littérature citée [9 références]  Voir  Masquer  Télécharger

https://hal.inria.fr/inria-00269225
Contributeur : Daniil Ryabko <>
Soumis le : mercredi 2 avril 2008 - 14:57:26
Dernière modification le : vendredi 9 mars 2018 - 14:38:02
Document(s) archivé(s) le : jeudi 20 mai 2010 - 22:49:47

Fichiers

tests.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : inria-00269225, version 1

Collections

Citation

Daniil Ryabko, Juergen Schmidhuber. Using Data Compressors to Construct Rank Tests. Applied Mathematics Letters, Elsevier, 2009, 22 (7), pp.1029-1032. 〈inria-00269225〉

Partager

Métriques

Consultations de la notice

334

Téléchargements de fichiers

179