Skip to Main content Skip to Navigation
Conference papers

N-Gram Based Secure Similar Document Detection

Abstract : Secure similar document detection (SSDD) plays an important role in many applications, such as justifying the need-to-know basis and facilitating communication between government agencies. The SSDD problem considers situations where Alice with a query document wants to find similar information from Bob’s document collection. During this process, the content of the query document is not disclosed to Bob, and Bob’s document collection is not disclosed to Alice. Existing SSDD protocols are developed under the vector space model, which has the advantage of identifying global similar information. To effectively and securely detect similar documents with overlapping text fragments, this paper proposes a novel n-gram based SSDD protocol.
Keywords : privacy security n-gram
Document type :
Conference papers
Complete list of metadata

Cited literature [10 references]  Display  Hide  Download

https://hal.inria.fr/hal-01586584
Contributor : Hal Ifip <>
Submitted on : Wednesday, September 13, 2017 - 8:56:00 AM
Last modification on : Thursday, August 8, 2019 - 4:02:02 PM
Long-term archiving on: : Thursday, December 14, 2017 - 12:57:04 PM

File

978-3-642-22348-8_19_Chapter.p...
Files produced by the author(s)

Licence


Distributed under a Creative Commons Attribution 4.0 International License

Identifiers

Citation

Wei Jiang, Bharath Samanthula. N-Gram Based Secure Similar Document Detection. 23th Data and Applications Security (DBSec), Jul 2011, Richmond, VA, United States. pp.239-246, ⟨10.1007/978-3-642-22348-8_19⟩. ⟨hal-01586584⟩

Share

Metrics