Schema-Guided Induction of Monadic Queries
Abstract
The induction of monadic node selecting queries from partially annotated XML-trees is a key task in Web information extraction. We show how to integrate schema guidance into an RPNI-based learning algorithm, in which monadic queries are represented by pruning node selecting tree transducers. We present experimental results on schema guidance by the DTD of HTML.
Domains
Machine Learning [cs.LG]
Origin : Files produced by the author(s)
Loading...