Distribution and Dependence of Extremes in Network Sampling Processes

Abstract : We explore the dependence structure in the sampled sequence of large networks. We consider randomized algorithms to sample the nodes and study extremal properties in any associated stationary sequence of characteristics of interest like node degrees, number of followers or income of the nodes in Online Social Networks etc, which satisfy two mixing conditions. Several useful extremes of the sampled sequence like $k$th largest value, clusters of exceedances over a threshold, first hitting time of a large value etc are investigated. We abstract the dependence and the statistics of extremes into a single parameter that appears in Extreme Value Theory, called extremal index (EI). In this work, we derive this parameter analytically and also estimate it empirically. We propose the use of EI as a parameter to compare different sampling procedures. As a specific example, degree correlations between neighboring nodes are studied in detail with three prominent random walks as sampling techniques.
Type de document :
Rapport
[Research Report] RR-8578, Inria. 2014, pp.25
Liste complète des métadonnées

Littérature citée [21 références]  Voir  Masquer  Télécharger

https://hal.inria.fr/hal-01054929
Contributeur : Jithin Sreedharan <>
Soumis le : mardi 24 février 2015 - 10:14:05
Dernière modification le : samedi 27 janvier 2018 - 01:31:43
Document(s) archivé(s) le : lundi 25 mai 2015 - 10:26:06

Fichiers

RR-8578.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : hal-01054929, version 3
  • ARXIV : 1408.2529

Collections

Citation

Konstantin Avrachenkov, Natalia M. Markovich, Jithin K. Sreedharan. Distribution and Dependence of Extremes in Network Sampling Processes. [Research Report] RR-8578, Inria. 2014, pp.25. 〈hal-01054929v3〉

Partager

Métriques

Consultations de la notice

126

Téléchargements de fichiers

112