Bounding Information Leakage in Machine Learning

Ganesh Del Grosso; George Pichler; Catuscia Palamidessi; Pablo Piantanida

doi:10.1016/j.neucom.2023.02.058

Journal Articles Neurocomputing Year : 2023

Bounding Information Leakage in Machine Learning

(1) , (2) , (1) , (3)

1
2
3

Ganesh Del Grosso

Function : Author
PersonId : 1211041
IdHAL : ganesh-del-grosso
ORCID : 0000-0002-7302-1078

Concurrency, Mobility and Transactions

George Pichler

Function : Author

Vienna University of Technology = Technische Universität Wien

Catuscia Palamidessi

Function : Author
PersonId : 1106247
ORCID : 0000-0003-4597-7002

Concurrency, Mobility and Transactions

Pablo Piantanida

Function : Author
PersonId : 736967
IdHAL : pablo-piantanida
ORCID : 0000-0002-8717-2117

International Laboratory on Learning Systems

Abstract

Recently, it has been shown that Machine Learning models can leak sensitive information about their training data. This information leakage is exposed through membership and attribute inference attacks. Although many attack strategies have been proposed, little effort has been made to formalize these problems. We present a novel formalism, generalizing membership and attribute inference attack setups previously studied in the literature and connecting them to memorization and generalization. First, we derive a universal bound on the success rate of inference attacks and connect it to the generalization gap of the target model. Second, we study the question of how much sensitive information is stored by the algorithm about its training set and we derive bounds on the mutual information between the sensitive attributes and model parameters. Experimentally, we illustrate the potential of our approach by applying it to both synthetic data and classification tasks on natural images. Finally, we apply our formalism to different attribute inference strategies, with which an adversary is able to recover the identity of writers in the PenDigits dataset.

Keywords

Membership Inference Privacy Attacks in Machine Learning

Domains

Computer Science [cs]

Fichier principal

2105.03875.pdf (608.62 Ko)

Origin : Files produced by the author(s)

Catuscia Palamidessi : Connect in order to contact the contributor

https://inria.hal.science/hal-04349219

Submitted on : Sunday, December 17, 2023-5:45:37 PM

Last modification on : Friday, May 17, 2024-3:08:03 PM

Dates and versions

hal-04349219 , version 1 (17-12-2023)

Licence

Attribution

Identifiers

HAL Id : hal-04349219 , version 1
ARXIV : 2105.03875
DOI : 10.1016/j.neucom.2023.02.058

Cite

Ganesh Del Grosso, George Pichler, Catuscia Palamidessi, Pablo Piantanida. Bounding Information Leakage in Machine Learning. Neurocomputing, 2023, 534, pp.1-17. ⟨10.1016/j.neucom.2023.02.058⟩. ⟨hal-04349219⟩

Export

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

X CNRS INRIA LIX X-LIX X-DEP-INFO CENTRALESUPELEC INRIA2 UNIV-PARIS-SACLAY IP_PARIS ANR GS-COMPUTER-SCIENCE HUB-IA ILLS

36 View

8 Download

Bounding Information Leakage in Machine Learning

Abstract

Keywords

Domains

Dates and versions

Licence

Identifiers

Cite

Export

Collections

Altmetric

Share