Pre-processing Framework for Twitter Sentiment Classification

Elias Dritsas; Gerasimos Vonitsanos; Ioannis E. Livieris; Andreas Kanavos; Aristidis Ilias; Christos Makris; Athanasios Tsakalidis

doi:10.1007/978-3-030-19909-8_12

Communication Dans Un Congrès Année : 2019

Pre-processing Framework for Twitter Sentiment Classification

(1) , (1) , (2) , (1, 2) , (1, 3) , (1, 3) , (1, 3)

1
2
3

Elias Dritsas

Fonction : Auteur
PersonId : 1058279

University of Patras

Gerasimos Vonitsanos

Fonction : Auteur
PersonId : 1058252

University of Patras

Ioannis E. Livieris

Fonction : Auteur
PersonId : 1058251

Technological Educational Institute of Western Greece [Patra]

Andreas Kanavos

Fonction : Auteur
PersonId : 992416

University of Patras

Technological Educational Institute of Western Greece [Patra]

Aristidis Ilias

Fonction : Auteur
PersonId : 991098

University of Patras

Department of Computer Engineering and Informatics [Patras]

Christos Makris

Fonction : Auteur
PersonId : 992346

University of Patras

Department of Computer Engineering and Informatics [Patras]

Athanasios Tsakalidis

Fonction : Auteur
PersonId : 992342

University of Patras

Department of Computer Engineering and Informatics [Patras]

Résumé

Twitter Sentiment Classification is undergoing great appeal from the research community; also, user posts and opinions are producing very interesting conclusions and information. In the context of this paper, a pre-processing tool was developed in Python language. This tool processes text and natural language data intending to remove wrong values and noise. The main reason for developing such a tool is to achieve sentiment analysis in an optimum and efficient way. The most remarkable characteristic is considered the use of emojis and emoticons in the sentiment analysis field. Moreover, supervised machine learning techniques were utilized for the analysis of users’ posts. Through our experiments, the performance of the involved classifiers, namely Naive Bayes and SVM, under specific parameters such as the size of the training data, the employed methods for feature selection (unigrams, bigrams and trigrams) are evaluated. Finally, the performance was assessed based on independent datasets through the application of k-fold cross validation.

Mots clés

Classification Microblogging Pre-processing Sentiment analysis Supervised machine learning Twitter

Domaines

Informatique [cs]

Fichier principal

484534_1_En_12_Chapter.pdf (280.24 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Hal Ifip : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-02363858

Soumis le : jeudi 14 novembre 2019-15:51:18

Dernière modification le : mardi 14 février 2023-15:30:05

Archivage à long terme le : samedi 15 février 2020-16:24:18

Dates et versions

hal-02363858 , version 1 (14-11-2019)

Licence

Paternité

Identifiants

HAL Id : hal-02363858 , version 1
DOI : 10.1007/978-3-030-19909-8_12

Citer

Elias Dritsas, Gerasimos Vonitsanos, Ioannis E. Livieris, Andreas Kanavos, Aristidis Ilias, et al.. Pre-processing Framework for Twitter Sentiment Classification. 15th IFIP International Conference on Artificial Intelligence Applications and Innovations (AIAI), May 2019, Hersonissos, Greece. pp.138-149, ⟨10.1007/978-3-030-19909-8_12⟩. ⟨hal-02363858⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

IFIP IFIP-AICT IFIP-TC IFIP-WG IFIP-TC12 IFIP-AIAI IFIP-WG12-5 IFIP-AICT-560

89 Consultations

52 Téléchargements

Pre-processing Framework for Twitter Sentiment Classification

Résumé

Mots clés

Domaines

Dates et versions

Licence

Identifiants

Citer

Exporter

Collections

Altmetric

Partager