Toward training NLP models to take into account privacy leakages

Gaspard Berthelier; Antoine Boutet; Antoine Richard

Communication Dans Un Congrès Année : 2023

Toward training NLP models to take into account privacy leakages

(1) , (1) , (2)

1
2

Gaspard Berthelier

Fonction : Auteur

Privacy Models, Architectures and Tools for the Information Society

Antoine Boutet

Fonction : Auteur
PersonId : 6722
IdHAL : antoine-boutet
IdRef : 170309207

Privacy Models, Architectures and Tools for the Information Society

Antoine Richard

Fonction : Auteur
PersonId : 1182658
IdHAL : a-t-richard

Hospices Civils de Lyon

Résumé

With the rise of machine learning and data-driven models especially in the field of Natural Language Processing (NLP), a strong demand for sharing data between organisations has emerged. However datasets are usually composed of personal data and thus subject to numerous regulations which require anonymization before disseminating the data. In the medical domain for instance, patient records are extremely sensitive and private, but the de-identification of medical documents is a complex task. Recent advances in NLP models have shown encouraging results in this field, but the question of whether deploying such models is safe remains. In this paper, we evaluate three privacy risks on NLP models trained on sensitive data. Specifically, we evaluate counterfactual memorization, which corresponds to rare and sensitive information which has too much influence on the model. We also evaluate membership inference as well as the ability to extract verbatim training data from the model. With this evaluation, we can cure data at risk from the training data and calibrate hyper parameters to provide a supplementary utility and privacy tradeoff to the usual mitigation strategies such as using differential privacy. We exhaustively illustrate the privacy leakage of NLP models through a use-case using medical texts and discuss the impact of both the proposed methodology and mitigation schemes.

Mots clés

NLP models Privacy Membership Inference Counterfactual Memorisation Data Extraction

Domaines

Informatique [cs]

Fichier principal

NLP_Privacy_Hopitaux (18).pdf (1.03 Mo)

Origine : Fichiers produits par l'(les) auteur(s)
Licence : Copyright (Tous droits réservés)

Antoine Boutet : Connectez-vous pour contacter le contributeur

https://hal.science/hal-04299405

Soumis le : mercredi 22 novembre 2023-10:42:29

Dernière modification le : vendredi 26 janvier 2024-08:35:34

Dates et versions

hal-04299405 , version 1 (22-11-2023)

Licence

Paternité

Identifiants

HAL Id : hal-04299405 , version 1

Citer

Gaspard Berthelier, Antoine Boutet, Antoine Richard. Toward training NLP models to take into account privacy leakages. BigData 2023 - IEEE International Conference on Big Data, Dec 2023, Sorrento, Italy. pp.1-9. ⟨hal-04299405⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

HCL INRIA INSA-LYON INRIA2 CITI INSA-GROUPE UDL ANR CYBERSCURITE

88 Consultations

71 Téléchargements

Toward training NLP models to take into account privacy leakages

Résumé

Mots clés

Domaines

Dates et versions

Licence

Identifiants

Citer

Exporter

Collections

Partager