Differential Privacy for Bayesian Inference through Posterior Sampling

Christos Dimitrakakis; Blaine Nelson; Zuhe Zhang; Aikateirni Mitrokotsa; Benjamin I P Rubinstein

Article Dans Une Revue Journal of Machine Learning Research Année : 2017

Differential Privacy for Bayesian Inference through Posterior Sampling

(1, 2, 3, 4) , (5) , (6) , (4) , (6)

1
2
3
4
5
6

Christos Dimitrakakis

Fonction : Auteur
PersonId : 6538
IdHAL : christos-dimitrakakis
ORCID : 0000-0002-5367-5189

Harvard University

Université de Lille, Sciences Humaines et Sociales

Sequential Learning

Chalmers University of Technology [Göteborg]

Blaine Nelson

Fonction : Auteur
PersonId : 1005341

Google Inc [Mountain View]

Zuhe Zhang

Fonction : Auteur
PersonId : 973254

University of Melbourne

Aikateirni Mitrokotsa

Fonction : Auteur

Chalmers University of Technology [Göteborg]

Benjamin I P Rubinstein

Fonction : Auteur
PersonId : 1005342

University of Melbourne

Résumé

Differential privacy formalises privacy-preserving mechanisms that provide access to a database. Can Bayesian inference be used directly to provide private access to data? The answer is yes: under certain conditions on the prior, sampling from the posterior distribution can lead to a desired level of privacy and utility. For a uniform treatment, we define differential privacy over arbitrary data set metrics, outcome spaces and distribution families. This allows us to also deal with non-i.i.d or non-tabular data sets. We then prove bounds on the sensitivity of the posterior to the data, which delivers a measure of robustness. We also show how to use posterior sampling to provide differentially private responses to queries, within a decision-theoretic framework. Finally, we provide bounds on the utility of answers to queries and on the ability of an adversary to distinguish between data sets. The latter are complemented by a novel use of Le Cam's method to obtain lower bounds on distinguishability. Our results hold for arbitrary metrics, including those for the common definition of differential privacy. For specific choices of the metric, we give a number of examples satisfying our assumptions. *. A preliminary version of this paper appeared in Algorithmic Learning Theory 2014 (Dimitrakakis et al., 2014). This version corrects proofs, constant factors in the upper bounds and introduces new material on utility analysis, lower bounds and examples.

Mots clés

Bayesian inference differential privacy robustness adversarial Learning

Domaines

Machine Learning [stat.ML] Cryptographie et sécurité [cs.CR] Statistiques [math.ST]

Fichier principal

15-257.pdf (498.77 Ko)

Origine : Fichiers éditeurs autorisés sur une archive ouverte

Christos Dimitrakakis : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-01500302

Soumis le : lundi 3 avril 2017-07:35:50

Dernière modification le : mercredi 24 avril 2024-16:52:49

Archivage à long terme le : mardi 4 juillet 2017-12:20:17

Dates et versions

hal-01500302 , version 1 (03-04-2017)

Identifiants

HAL Id : hal-01500302 , version 1

Citer

Christos Dimitrakakis, Blaine Nelson, Zuhe Zhang, Aikateirni Mitrokotsa, Benjamin I P Rubinstein. Differential Privacy for Bayesian Inference through Posterior Sampling. Journal of Machine Learning Research, 2017, 18 (11), pp.1−39. ⟨hal-01500302⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-LILLE3 CNRS INRIA CRISTAL INRIA2 CRISTAL-SEQUEL UNIV-LILLE

515 Consultations

288 Téléchargements

Differential Privacy for Bayesian Inference through Posterior Sampling

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager