Sampling Online Social Networks: An Experimental Study of Twitter

Abstract : Online social networks (OSNs) are an important source of information for scientists in different fields such as computer science, sociology, economics, etc. However, it is hard to study OSNs as they are very large. For instance, Facebook has 1.28 billion active users in March 2014 and Twitter claims 255 million active users in April 2014. Also, com-panies take measures to prevent crawls of their OSNs and refrain from sharing their data with the research community. For these reasons, we argue that sampling techniques will be the best technique to study OSNs in the future. In this work, we take an experimental approach to study the characteristics of well-known sampling techniques on a full social graph of Twitter crawled in 2012 [2]. Our contri-bution is to evaluate the behavior of these techniques on a real directed graph by considering two sampling scenarios: (a) obtaining most popular users (b) obtaining an unbiased sample of users, and to find the most suitable sampling tech-niques for each scenario.
Document type :
Poster communications
Complete list of metadatas

Cited literature [3 references]  Display  Hide  Download

https://hal.inria.fr/hal-01096980
Contributor : Maksym Gabielkov <>
Submitted on : Thursday, December 18, 2014 - 3:38:25 PM
Last modification on : Thursday, January 11, 2018 - 4:22:44 PM
Long-term archiving on : Saturday, April 15, 2017 - 10:43:55 AM

Files

gabielkov_sampling.pdf
Files produced by the author(s)

Identifiers

Collections

Citation

Maksym Gabielkov, Ashwin Rao, Arnaud Legout. Sampling Online Social Networks: An Experimental Study of Twitter. ACM SIGCOMM 2014, Dec 2014, Chicago, IL, United States. ⟨10.1145/2619239.2631452⟩. ⟨hal-01096980⟩

Share

Metrics

Record views

364

Files downloads

536