ρ-uncertainty: Inference-Proof Transaction Anonymization

Jianneng Cao; Panagiotis Karras; Chedy Raïssi; Kian-Lee Tan

doi:10.14778/1920841.1920971

Article Dans Une Revue Proceedings of the VLDB Endowment (PVLDB) Année : 2010

ρ-uncertainty: Inference-Proof Transaction Anonymization

(1) , (1) , (2) , (1)

1
2

Jianneng Cao

Fonction : Auteur correspondant
PersonId : 906321

Connectez-vous pour contacter l'auteur

Department of Computer Science - Singapore

Panagiotis Karras

Fonction : Auteur
PersonId : 906322

Department of Computer Science - Singapore

Chedy Raïssi

Fonction : Auteur
PersonId : 16730
IdHAL : chedy-raissi
IdRef : 125691750

Knowledge representation, reasonning

Kian-Lee Tan

Fonction : Auteur
PersonId : 906323

Department of Computer Science - Singapore

Résumé

The publication of transaction data, such as market basket data, medical records, and query logs, serves the public benefit. Mining such data allows for the derivation of association rules that connect certain items to others with measurable confidence. Still, this type of data analysis poses a privacy threat; an adversary having partial information on a person's behavior may confidently associate that person to an item deemed to be sensitive. Ideally, an anonymization of such data should lead to an inference-proof version that prevents the association of individuals to sensitive items, while otherwise allowing for truthful associations to be derived. Original approaches to this problem were based on value perturbation, damaging data integrity. Recently, value generalization has been proposed as an alternative; still, approaches based on it have assumed either that all items are equally sensitive, or that some are sensitive and can be known to an adversary only by association, while others are non-sensitive and can be known directly. Yet in reality there is a distinction between sensitive and non-sensitive items, but an adversary may possess information on any of them. Most critically, no antecedent method aims at a clear inference-proof privacy guarantee. In this paper, we propose 휌-uncertainty, the first, to our knowledge, privacy concept that inherently safeguards against sensitive associations without constraining the nature of an adversary's knowledge and without falsifying data. The problem of achieving 휌-uncertainty with low information loss is challenging because it is natural. A trivial solution is to suppress all sensitive items. We develop more sophisticated schemes. In a broad experimental study, we show that the problem is solved non-trivially by a technique that combines generalization and suppression, which also achieves favorable results compared to a baseline perturbation-based scheme.

Mots clés

privacy association rules inference-proof

Domaines

Base de données [cs.DB]

Fichier principal

R92.pdf (1005.39 Ko)

Origine : Fichiers éditeurs autorisés sur une archive ouverte

Chedy Raïssi : Connectez-vous pour contacter le contributeur

https://inria.hal.science/inria-00610934

Soumis le : lundi 25 juillet 2011-12:15:47

Dernière modification le : vendredi 24 mars 2023-14:52:54

Archivage à long terme le : mercredi 26 octobre 2011-02:21:33

Dates et versions

inria-00610934 , version 1 (25-07-2011)

Identifiants

HAL Id : inria-00610934 , version 1
DOI : 10.14778/1920841.1920971

Citer

Jianneng Cao, Panagiotis Karras, Chedy Raïssi, Kian-Lee Tan. ρ-uncertainty: Inference-Proof Transaction Anonymization. Proceedings of the VLDB Endowment (PVLDB), 2010, 3 (1), pp.1033-1044. ⟨10.14778/1920841.1920971⟩. ⟨inria-00610934⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS INRIA UNIV-LORRAINE INRIA2 LORIA

406 Consultations

521 Téléchargements

ρ-uncertainty: Inference-Proof Transaction Anonymization

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager