Utility-Driven Anonymization in Data Publishing

Abstract : Privacy-preserving data publication has been studied intensely in the past years. To date, all existing approaches transform data values by random perturbation or generalization. In this paper, we introduce a radically different data anonymization methodology. Our proposal aims to maintain a certain amount of {\em patterns}, defined in terms of a set of properties of interest that hold for the original data. Such properties are represented as linear relationships among data points. We present an algorithm that generates a set of anonymized data that strictly preserves these properties, thus maintaining specified {\em patterns} in the data. Extensive experiments with real and synthetic data show that our algorithm is efficient, and produces anonymized data that affords high utility in several data analysis tasks while safeguarding privacy.
Document type :
Conference papers
Liste complète des métadonnées

https://hal.inria.fr/inria-00623578
Contributor : Chedy Raïssi <>
Submitted on : Wednesday, September 14, 2011 - 4:04:54 PM
Last modification on : Thursday, January 11, 2018 - 6:19:54 AM

Identifiers

  • HAL Id : inria-00623578, version 1

Collections

Citation

Mingqiang Xue, Panagiotis Karras, Chedy Raïssi, Hung Keng Pung. Utility-Driven Anonymization in Data Publishing. 20th ACM Conference on Information and Knowledge Management - CIKM 2011, Oct 2011, Glasgow, United Kingdom. ACM, 2011. 〈inria-00623578〉

Share

Metrics

Record views

193