Active Learning Enhanced Document Annotation for Sentiment Analysis - Archive ouverte HAL Access content directly
Conference Papers Year : 2013

Active Learning Enhanced Document Annotation for Sentiment Analysis

(1) , (1)


Sentiment analysis is a popular research area devoted to methods allowing automatic analysis of the subjectivity in textual content. Many of these methods are based on the using of machine learning and they usually depend on manually annotated training corpora. However, the creation of corpora is a time-consuming task, which leads to necessity of methods facilitating this process. Methods of active learning, aimed at the selection of the most informative examples according to the given classification task, can be utilized in order to increase the effectiveness of the annotation. Currently it is a lack of systematical research devoted to the application of active learning in the creation of corpora for sentiment analysis. Hence, the aim of this work is to survey some of the active learning strategies applicable in annotation tools used in the context of sentiment analysis. We evaluated compared strategies on the domain of product reviews. The results of experiments confirmed the increase of the corpus quality in terms of higher classification accuracy achieved on the test set for most of the evaluated strategies (more than 20% higher accuracy in comparison to the random strategy).
Fichier principal
Vignette du fichier
978-3-642-40511-2_24_Chapter.pdf (387.62 Ko) Télécharger le fichier
Origin : Files produced by the author(s)

Dates and versions

hal-01506776 , version 1 (12-04-2017)


Attribution - CC BY 4.0


  • HAL Id : hal-01506776 , version 1


Peter Koncz, Ján Paralič. Active Learning Enhanced Document Annotation for Sentiment Analysis. 1st Cross-Domain Conference and Workshop on Availability, Reliability, and Security in Information Systems (CD-ARES), Sep 2013, Regensburg, Germany. pp.345-353. ⟨hal-01506776⟩
164 View
325 Download


Gmail Facebook Twitter LinkedIn More