Aspects of Semi-Supervised and Active Learning in Conditional Random Fields - Inria - Institut national de recherche en sciences et technologies du numérique Access content directly
Conference Papers Year : 2011

Aspects of Semi-Supervised and Active Learning in Conditional Random Fields

Abstract

Conditional random fields are among the state-of-the art approaches to structured output prediction, and the model has been adopted for various real-world problems. The supervised classification is expensive, since it is usually expensive to produce labelled data. Unlabeled data are relatively cheap, but how to use it? Unlabeled data can be used to estimate marginal probability of observations, and we exploit this idea in our work. Introduction of unlabeled data and of probability of observations into a purely discriminative model is a challenging task. We consider an extrapolation of a recently proposed semi-supervised criterion to the model of conditional random fields, and show its drawbacks. We discuss alternative usage of the marginal probability and propose a pool-based active learning approach based on quota sampling. We carry out experiments on synthetic as well as on standard natural language data sets, and we show that the proposed quota sampling active learning method is efficient.
Fichier principal
Vignette du fichier
sokolovska_243.pdf (228.53 Ko) Télécharger le fichier
Origin : Files produced by the author(s)
Loading...

Dates and versions

hal-00624831 , version 1 (20-09-2011)

Identifiers

  • HAL Id : hal-00624831 , version 1

Cite

Nataliya Sokolovska. Aspects of Semi-Supervised and Active Learning in Conditional Random Fields. ECML PKDD 2011, Sep 2011, Greece. pp.273-288. ⟨hal-00624831⟩
147 View
229 Download

Share

Gmail Facebook X LinkedIn More