Why Johnny Can't Browse in Peace: On the Uniqueness of Web Browsing History Patterns

Lukasz Olejnik 1 Claude Castelluccia 1 Artur Janc 2
1 PLANETE - Protocols and applications for the Internet
Inria Grenoble - Rhône-Alpes, CRISAM - Inria Sophia Antipolis - Méditerranée
Abstract : We present the results of the first large-scale study of the uniqueness of Web browsing histories, gathered from a total of 368; 284 Internet users who visited a history detection demonstration website. Our results show that for a majority of users (69%), the browsing history is unique and that users for whom we could detect at least 4 visited websites were uniquely identified by their histories in 97% of cases. We observe a significant rate of stability in browser history fingerprints: for repeat visitors, 38% of fingerprints are identical over time, and differing ones were correlated with original history contents, indicating static browsing preferences (for history subvectors of size 50). We report a striking result that it is enough to test for a small number of pages in order to both enumerate users' interests and perform an efficient and unique behavioral fingerprint; we show that testing 50 web pages is enough to fingerprint 42% of users in our database, increasing to 70% with 500 web pages. Finally, we show that indirect history data, such as information about categories of visited websites can also be effective in fingerprinting users, and that similar fingerprinting can be performed by common script providers such as Google or Facebook.
Type de document :
Communication dans un congrès
5th Workshop on Hot Topics in Privacy Enhancing Technologies (HotPETs 2012), Jul 2012, Vigo, Spain. 2012
Liste complète des métadonnées

Littérature citée [22 références]  Voir  Masquer  Télécharger

https://hal.inria.fr/hal-00747841
Contributeur : Claude Castelluccia <>
Soumis le : vendredi 2 novembre 2012 - 11:51:11
Dernière modification le : mercredi 11 avril 2018 - 01:53:01
Document(s) archivé(s) le : dimanche 3 février 2013 - 03:36:30

Fichier

johnny2hotpet-finalcam.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : hal-00747841, version 1

Collections

Citation

Lukasz Olejnik, Claude Castelluccia, Artur Janc. Why Johnny Can't Browse in Peace: On the Uniqueness of Web Browsing History Patterns. 5th Workshop on Hot Topics in Privacy Enhancing Technologies (HotPETs 2012), Jul 2012, Vigo, Spain. 2012. 〈hal-00747841〉

Partager

Métriques

Consultations de la notice

1138

Téléchargements de fichiers

912