inria-00309408, version 2
Schema-Guided Induction of Monadic Queries
Jérôme Champavère
1, 2Rémi Gilleron
1, 2Aurélien Lemay
1, 2Joachim Niehren
2
9th International Colloquium on Grammatical Inference 5278 (2008) 15-28
Abstract: The induction of monadic node selecting queries from partially annotated XML-trees is a key task in Web information extraction. We show how to integrate schema guidance into an RPNI-based learning algorithm, in which monadic queries are represented by pruning node selecting tree transducers. We present experimental results on schema guidance by the DTD of HTML.
- 1: GRAPPA (LIFL)
- CNRS : UMR8022 – Université Charles de Gaulle - Lille III – Université des Sciences et Technologies de Lille - Lille I
- 2: MOSTRARE (INRIA Lille - Nord Europe)
- INRIA – CNRS : UMR8022 – Université des Sciences et Technologies de Lille - Lille I : EA3588 – Université Charles de Gaulle - Lille III
- Domain : Computer Science/Learning
- Available versions : v1 (2008-08-07) v2 (2009-06-26)
- inria-00309408, version 2
- http://hal.inria.fr/inria-00309408
- oai:hal.inria.fr:inria-00309408
- From: Joachim Niehren
- Submitted on: Friday, 26 June 2009 16:50:18
- Updated on: Friday, 26 June 2009 16:55:16






Associated documents
Export