Spamming in Linked Data

Abstract : The rapidly growing commercial interest in Linked Data raises the prospect of "Linked Data spam", which we define as "deliberately misleading information (data and links) published as Linked Data, with the goal of creating financial gain for the publisher". Compared to conventional technologies affected by spamming, e.g. email and blogs, spammers targeting Linked Data may not be able to push information directly towards consumers, but rather may seek to exploit a lack of human involvement in automated data integration processes performed by applications consuming Linked Data. This paper aims to lay a foundation for future work addressing the issue of Linked Data spam, by providing the following contributions: i) a formal definition of spamming in Linked Data; ii) a classification of potential spamming techniques; iii) a sample dataset demonstrating these techniques, for use in evaluating anti-spamming mechanisms; iv) preliminary recommendations for anti-spamming strategies.
Document type :
Conference papers
Complete list of metadatas

Cited literature [21 references]  Display  Hide  Download

https://hal.inria.fr/hal-00751205
Contributor : Luca Costabello <>
Submitted on : Monday, November 12, 2012 - 10:03:39 PM
Last modification on : Tuesday, February 26, 2019 - 8:22:06 AM
Long-term archiving on: Wednesday, February 13, 2013 - 3:48:03 AM

File

cold2012.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-00751205, version 1

Collections

Citation

Ali Hasnain, Mustafa Al-Bakri, Luca Costabello, Zijie Cong, Ian Davis, et al.. Spamming in Linked Data. Third International Workshop on Consuming Linked Data (COLD2012), Nov 2012, Boston, United States. ⟨hal-00751205⟩

Share

Metrics

Record views

762

Files downloads

393