inria-00309408, version 2
Schema-Guided Induction of Monadic Queries
Jérôme Champavère
1, 2Rémi Gilleron
1, 2Aurélien Lemay
1, 2Joachim Niehren
2
9th International Colloquium on Grammatical Inference 5278 (2008) 15-28
Résumé : The induction of monadic node selecting queries from partially annotated XML-trees is a key task in Web information extraction. We show how to integrate schema guidance into an RPNI-based learning algorithm, in which monadic queries are represented by pruning node selecting tree transducers. We present experimental results on schema guidance by the DTD of HTML.
- 1 : GRAPPA (LIFL)
- CNRS : UMR8022 – Université Charles de Gaulle - Lille III – Université Lille 1 - Sciences et Technologies
- 2 : MOSTRARE (INRIA Lille - Nord Europe)
- INRIA – CNRS : UMR8022 – Université Lille 1 - Sciences et Technologies : EA3588 – Université Charles de Gaulle - Lille III
- Domaine : Informatique/Apprentissage
- Versions disponibles : v1 (07-08-2008) v2 (26-06-2009)
- inria-00309408, version 2
- http://hal.inria.fr/inria-00309408
- oai:hal.inria.fr:inria-00309408
- Contributeur : Joachim Niehren
- Soumis le : Vendredi 26 Juin 2009, 16:50:18
- Dernière modification le : Vendredi 26 Juin 2009, 16:55:16






Documents associés
Exporter