Data Harvesting 2.0: from the Visible to the Invisible Web

Claude Castelluccia 1, * Stéphane Grumbach 2, * Lukasz Olejnik 1
* Auteur correspondant
1 PRIVATICS - Privacy Models, Architectures and Tools for the Information Society
Inria Grenoble - Rhône-Alpes, CITI - CITI Centre of Innovation in Telecommunications and Integration of services
2 DICE - Data on the Internet at the Core of the Economy
Inria Grenoble - Rhône-Alpes, INSA - Institut National des Sciences Appliquées
Abstract : Personal data are fuelling a fast emerging industry which transform them into added value. Harvesting these data is therefore of the outermost importance for the economy. In this paper, we study the flows of personal data at a global level, and distinguish countries based on their capacity to harvest data. We establish a cartography of international data channels on the visible and invisible Web. The visible Web is composed of the sites that are available to the general public and are typically indexed by search engines. The invisible Web refers to tags, Web bugs, pixels and beacons that appear on Websites to track and profile users. It is well known that the US dominate the visible Web with more than 70% of the top 100 sites in the world. We show that this domination is even stronger on the invisible Web.The largest proportion of trackers in most countries are indeed from the US. Apart from the US, two countries exhibit an original strategy. China, which dominates its visible Web with a majority of local sites, but surprisingly these sites still contain a majority of US trackers. Russia, which also dominates its visible Web, and is the only country with more local trackers than US ones.
keyword : trackers
Type de document :
Communication dans un congrès
The Twelfth Workshop on the Economics of Information Security, Jun 2013, Washington, DC, United States. 2013
Domaine :
Liste complète des métadonnées


https://hal.inria.fr/hal-00832784
Contributeur : Stephane Grumbach <>
Soumis le : mardi 11 juin 2013 - 14:37:51
Dernière modification le : mercredi 18 novembre 2015 - 01:11:24
Document(s) archivé(s) le : jeudi 12 septembre 2013 - 04:07:54

Fichier

WEIS13-final.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : hal-00832784, version 1

Collections

Citation

Claude Castelluccia, Stéphane Grumbach, Lukasz Olejnik. Data Harvesting 2.0: from the Visible to the Invisible Web. The Twelfth Workshop on the Economics of Information Security, Jun 2013, Washington, DC, United States. 2013. <hal-00832784>

Partager

Métriques

Consultations de
la notice

485

Téléchargements du document

661