Active Learning Enhanced Document Annotation for Sentiment Analysis - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Communication Dans Un Congrès Année : 2013

Active Learning Enhanced Document Annotation for Sentiment Analysis

Résumé

Sentiment analysis is a popular research area devoted to methods allowing automatic analysis of the subjectivity in textual content. Many of these methods are based on the using of machine learning and they usually depend on manually annotated training corpora. However, the creation of corpora is a time-consuming task, which leads to necessity of methods facilitating this process. Methods of active learning, aimed at the selection of the most informative examples according to the given classification task, can be utilized in order to increase the effectiveness of the annotation. Currently it is a lack of systematical research devoted to the application of active learning in the creation of corpora for sentiment analysis. Hence, the aim of this work is to survey some of the active learning strategies applicable in annotation tools used in the context of sentiment analysis. We evaluated compared strategies on the domain of product reviews. The results of experiments confirmed the increase of the corpus quality in terms of higher classification accuracy achieved on the test set for most of the evaluated strategies (more than 20% higher accuracy in comparison to the random strategy).
Fichier principal
Vignette du fichier
978-3-642-40511-2_24_Chapter.pdf (387.62 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-01506776 , version 1 (12-04-2017)

Licence

Paternité

Identifiants

  • HAL Id : hal-01506776 , version 1

Citer

Peter Koncz, Ján Paralič. Active Learning Enhanced Document Annotation for Sentiment Analysis. 1st Cross-Domain Conference and Workshop on Availability, Reliability, and Security in Information Systems (CD-ARES), Sep 2013, Regensburg, Germany. pp.345-353. ⟨hal-01506776⟩
174 Consultations
340 Téléchargements

Partager

Gmail Facebook X LinkedIn More