Utility-Driven Anonymization in Data Publishing - Archive ouverte HAL Access content directly
Conference Papers Year : 2011

Utility-Driven Anonymization in Data Publishing

(1) , (2) , (3) , (1)


Privacy-preserving data publication has been studied intensely in the past years. To date, all existing approaches transform data values by random perturbation or generalization. In this paper, we introduce a radically different data anonymization methodology. Our proposal aims to maintain a certain amount of {\em patterns}, defined in terms of a set of properties of interest that hold for the original data. Such properties are represented as linear relationships among data points. We present an algorithm that generates a set of anonymized data that strictly preserves these properties, thus maintaining specified {\em patterns} in the data. Extensive experiments with real and synthetic data show that our algorithm is efficient, and produces anonymized data that affords high utility in several data analysis tasks while safeguarding privacy.
Not file

Dates and versions

inria-00623578 , version 1 (14-09-2011)


  • HAL Id : inria-00623578 , version 1


Mingqiang Xue, Panagiotis Karras, Chedy Raïssi, Hung Keng Pung. Utility-Driven Anonymization in Data Publishing. 20th ACM Conference on Information and Knowledge Management - CIKM 2011, ACM, Oct 2011, Glasgow, United Kingdom. ⟨inria-00623578⟩
109 View
0 Download


Gmail Facebook Twitter LinkedIn More