Semantic Linking for Event-Based Classification of Tweets

Abstract : Detecting which tweets are related to events and classifying them into categories is a challenging task due to the peculiarities of Twitter language and to the lack of contextual information. We propose to face this challenge by taking advantage of the information that can be automatically acquired from external knowledge bases. In particular, we enrich and generalise the textual content of tweets by linking the Named Entities (NE) to concepts in both DBpedia and YAGO ontologies, and exploit their specific or generic types to replace NE mentions in tweets. The approach we propose in this paper is applied to build a supervised classifier to separate event-related from non event-related tweets, as well as to associate to event-related tweets the event categories defined by the Topic Detection and Tracking community (TDT). We compare Naive Bayes (NB), Support Vector Machines (SVM) and Long Short-Term Memory (LSTM) classification algorithms, showing that NE linking and replacement improves classification performance and contributes to reducing overfitting, especially with Recurrent Neural Networks (RNN).
Type de document :
Article dans une revue
International Journal of Computational Linguistics and Applications, Alexander Gelbukh, 2017, pp.12
Liste complète des métadonnées


https://hal.inria.fr/hal-01529729
Contributeur : Amosse Edouard <>
Soumis le : mercredi 31 mai 2017 - 12:14:50
Dernière modification le : jeudi 15 juin 2017 - 09:09:35

Fichier

paper 228.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : hal-01529729, version 1

Collections

Citation

Edouard Amosse, Elena Cabrio, Sara Tonelli, Nhan Le Thanh. Semantic Linking for Event-Based Classification of Tweets. International Journal of Computational Linguistics and Applications, Alexander Gelbukh, 2017, pp.12. <hal-01529729>

Partager

Métriques

Consultations de
la notice

105

Téléchargements du document

69