Skip to Main content Skip to Navigation
New interface
Conference papers

Establishing a Strong Baseline for Privacy Policy Classification

Abstract : Digital service users are routinely exposed to Privacy Policy consent forms, through which they enter contractual agreements consenting to the specifics of how their personal data is managed and used. Nevertheless, despite renewed importance following legislation such as the European GDPR, a majority of people still ignore policies due to their length and complexity. To counteract this potentially dangerous reality, in this paper we present three different models that are able to assign pre-defined categories to privacy policy paragraphs, using supervised machine learning. In order to train our neural networks, we exploit a dataset containing 115 privacy policies defined by US companies. An evaluation shows that our approach outperforms state-of-the-art by 5% over comparable and previously-reported F1 values. In addition, our method is completely reproducible since we provide open access to all resources. Given these two contributions, our approach can be considered as a strong baseline for privacy policy classification.
Document type :
Conference papers
Complete list of metadata

https://hal.inria.fr/hal-03440825
Contributor : Hal Ifip Connect in order to contact the contributor
Submitted on : Monday, November 22, 2021 - 3:32:23 PM
Last modification on : Monday, November 22, 2021 - 4:37:51 PM
Long-term archiving on: : Wednesday, February 23, 2022 - 7:57:14 PM

File

 Restricted access
To satisfy the distribution rights of the publisher, the document is embargoed until : 2023-01-01

Please log in to resquest access to the document

Licence


Distributed under a Creative Commons Attribution 4.0 International License

Identifiers

Collections

Citation

Najmeh Mousavi Nejad, Pablo Jabat, Rostislav Nedelchev, Simon Scerri, Damien Graux. Establishing a Strong Baseline for Privacy Policy Classification. 35th IFIP International Conference on ICT Systems Security and Privacy Protection (SEC), Sep 2020, Maribor, Slovenia. pp.370-383, ⟨10.1007/978-3-030-58201-2_25⟩. ⟨hal-03440825⟩

Share

Metrics

Record views

65