, How to prevent getting blacklisted while scraping, p.29, 2019.
The Web Never Forgets: Persistent Tracking Mechanisms in the Wild, Proceedings of the 2014 ACM SIGSAC Conference on Computer and Communications Security (CCS '14), pp.674-689, 2014. ,
FPDetective: dusting the web for fingerprinters, Proceedings of the, pp.1129-1140, 2013. ,
, Alexa: an Amazon.com company. 2019. Alexa: the top sites on the web, p.29, 2019.
Precise Detection of Content Reuse in the Web, SIGCOMM Comput. Commun. Rev, vol.49, issue.2, pp.9-24, 2019. ,
Crawling a Country: Better Strategies Than Breadth-first for Web Page Ordering, Special Interest Tracks and Posters of the 14th International Conference on World Wide Web (WWW '05), pp.864-872, 2005. ,
Web Scraping and Crawling Are Perfectly Legal, 2018. ,
Security Analysis of Subject Access Request Procedures, Privacy Technologies and Policy, pp.182-209, 2019. ,
URL : https://hal.archives-ouvertes.fr/hal-02072302
Graph structure in the Web, Computer Networks, vol.33, pp.309-320, 2000. ,
Scheduling algorithms for Web crawling, WebMedia and LA-Web, pp.10-17, 2004. ,
, SeleniumHQ Browser Automation, 2019.
, World Wide Web Consortium, W3C Webdriver Standard, 2019.
The Web's Sixth Sense: A Study of Scripts Accessing Smartphone Sensors, Proceedings of the 2018 ACM SIGSAC Conference on Computer and Communications Security -CCS '18, pp.1515-1532, 2018. ,
Disconnect Tracking Protection List, 2019. ,
Web robot detection techniques: overview and limitations, Data Mining and Knowledge Discovery, vol.22, pp.183-210, 2011. ,
How Unique Is Your Web Browser, Privacy Enhancing Technologies, pp.1-18, 2010. ,
Online Tracking: A 1-million-site Measurement and Analysis, Proceedings of the 2016 ACM SIGSAC Conference on Computer and Communications Security (CCS '16), pp.1388-1401, 2016. ,
, Michalis Faloutsos, Petros Faloutsos, and Christos Faloutsos. 1999. On Power-law Relationships of the Internet Topology. SIGCOMM Comput, vol.29, pp.251-262, 1999.
SpeedReader: Reader Mode Made Fast and Private, The World Wide Web Conference (WWW '19), pp.526-537, 2019. ,
, , 2019.
Learning Word Vectors for 157 Languages, Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018), Nicoletta Calzolari (Conference chair), 2018. ,
An Automated Approach for Complementing Ad Blockers' Blacklists, Proceedings on Privacy Enhancing Technologies, vol.2, pp.282-298, 2015. ,
Analysis and Simulation of Computer Telecommunication Systems (MASCOTS, Proceedings of the 11th IEEE/ACM International Symposium on Modeling, pp.16-25, 2003. ,
Like a Pack of Wolves: Community Structure of Web Trackers, Passive and Active Measurement, pp.42-54, 2016. ,
A survey of Web crawlers for information retrieval, Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery, vol.7, issue.6, p.1218, 2017. ,
Beauty and the Beast: Diverting Modern Web Browsers to Build Unique Browser Fingerprints, 2016 IEEE Symposium on Security and Privacy (SP, pp.878-894, 2016. ,
URL : https://hal.archives-ouvertes.fr/hal-01285470
, Tranco: A Research-Oriented Top Sites Ranking Hardened Against Manipulation, 2018.
Dark Patterns at Scale: Findings from a Crawl of 11K Shopping Websites, pp.1-32, 2019. ,
The Price of Free: Privacy Leakage in Personalized Mobile In-App Ads, Proceedings 2016 Network and Distributed System Security Symposium. Internet Society, 2016. ,
Block me if you can: A large-scale study of tracker-blocking tools, 2017 IEEE European Symposium on Security and Privacy (EuroS&P), pp.319-333, 2017. ,
Why Is the Shape of the Web a Bowtie, Proceedings of the 2012 World Wide Web conference, WebScience Track (WWW '12), vol.3, 2012. ,
, The Graph Structure in the Web -Analyzed on Different Aggregation Levels, vol.1, pp.33-47, 2015.
Measurement and Analysis of Online Social Networks, Proceedings of the 7th ACM SIGCOMM Conference on Internet Measurement (IMC '07), pp.29-42, 2007. ,
Watching You Watch: The Tracking Ecosystem of Over-the-Top TV Streaming Devices, Proceedings of the 2019 ACM SIGSAC Conference on Computer and Communications Security (CCS '19), pp.131-147, 2019. ,
Firefox Now Available with Enhanced Tracking Protection by Default Plus Updates to Facebook Container, Firefox Monitor and Lockwise, p.14, 2019. ,
JESTr Pioneer Shield Study, 2019. ,
, Mozilla Privacy Policy, 2019.
Security/Anti tracking policy, p.29, 2019. ,
Study Companion Repository, 2019. ,
Cookieless Monster: Exploring the Ecosystem of Web-Based Device Fingerprinting, 2013 IEEE Symposium on Security and Privacy, pp.541-555, 2013. ,
Why Johnny Can't Browse in Peace: On the Uniqueness of Web Browsing History Patterns, 5th Workshop on Hot Topics in Privacy Enhancing Technologies, pp.1-17, 2012. ,
URL : https://hal.archives-ouvertes.fr/hal-00747841
How Unique is Your .Onion?: An Analysis of the Fingerprintability of Tor Onion Services, Proceedings of the 2017 ACM SIGSAC Conference on Computer and Communications Security (CCS '17), pp.2021-2036, 2017. ,
Kraaler: A User-Perspective Web Crawler, Network Traffic Measurement and Analysis Conference, 2019. ,
, , pp.153-160
Cookie Synchronization: Everything You Always Wanted to Know But Were Afraid to Ask, pp.1-11, 2018. ,
MyAd-Choices: Bringing Transparency and Control to Online Advertising, ACM Transactions on the Web (TWEB), vol.11, pp.1-7, 2017. ,
Evaluating the Long-term Effects of Parameters on the Characteristics of the Tranco Top Sites Ranking, 12th USENIX Workshop on Cyber Security Experimentation and Test (CSET 19). USENIX Association, 2019. ,
Is scraping and crawling to collect data illegal?, 2018. ,
Web View: Measuring Monitoring Representative Information on Websites, 2019 22nd Conference on Innovation in Clouds, Internet and Networks and Workshops (ICIN), vol.4, pp.133-138, 2019. ,
URL : https://hal.archives-ouvertes.fr/hal-02072471
On the Ubiquity of Web Tracking: Insights from a Billion-Page Web Crawl, The Journal of Web Science, vol.4, pp.53-66, 2018. ,
, , 2019.
The Majestic Million: The million domains we find with the most referring subnets, p.29, 2019. ,
NoMoAds: Effective and Efficient Cross-App Mobile Ad-Blocking, Proceedings on Privacy Enhancing, vol.4, pp.125-140, 2018. ,
Jellyfish: A conceptual model for the AS internet topology, Journal of Communications and Networks -JCN, vol.8, pp.1667-1671, 2005. ,
International World Wide Web Conferences Steering Committee, Republic and Canton of, Proceedings of the 26th International Conference on World Wide Web (WWW '17), pp.877-886, 2017. ,
Scraping 1 million keywords on the Google Search Engine, 2019. ,
Mobile Friendly or Attacker Friendly?: A Large-scale Security Evaluation of Mobile-first Websites, Proceedings of the 2019 ACM Asia Conference on Computer and Communications Security (Asia CCS '19), pp.206-213, 2019. ,
, MDN web docs contributors. 2019. webNavigation, p.29, 2019.
, Studies using OpenWPM, 2019.
An Early Warning System for Unrecognized Drug Side Effects Discovery, Proceedings of the 21st International Conference on World Wide Web (WWW '12 Companion), pp.437-440, 2012. ,
International World Wide Web Conferences Steering Committee, Republic and Canton of Geneva, Proceedings of the 25th International Conference on World Wide Web (WWW '16), pp.121-132, 2016. ,
A Privacy Analysis of Cross-device Tracking, 26th USENIX Security Symposium (USENIX Security 17). USENIX Association, pp.1391-1408, 2017. ,
Understanding the Unplanned Internet -How Ad Tech is Broken By Design 101, 2019. ,