inria-00088077, version 2
Learning n-ary Node Selecting Tree Transducers from Completely Annotated Examples
Aurélien Lemay a, 1Joachim Niehren
1Rémi Gilleron a, 1
8th International Colloquium on Grammatical Inference 4201 (2006) 253-267
Résumé : We present the first algorithm for learning n-ary node selection queries in trees from completely annotated examples by methods of grammatical inference. We propose to represent n-ary queries by deterministic n-ary node selecting tree transducers (NSTTs), that are known to capture the class of MSO-definable n-ary queries. Despite of this highly expressive, we show that n-aryy queries, selecting a polynomially bounded number of tuples per tree, represented by deterministic NSTTs can be learned from polynomial time and data while allowing for efficient enumeration of query answers. An application to wrapper induction in Web information extraction yields encouraging results.
- a – Université Charles de Gaulle - Lille III
- 1 : MOSTRARE (INRIA Futurs)
- INRIA – CNRS : UMR8022 – Université Lille 1 - Sciences et Technologies : EA3588 – Université Charles de Gaulle - Lille III
- Domaine : Informatique/Apprentissage
- Versions disponibles : v1 (31-07-2006) v2 (14-08-2006)
- inria-00088077, version 2
- http://hal.inria.fr/inria-00088077
- oai:hal.inria.fr:inria-00088077
- Contributeur : Joachim Niehren
- Soumis le : Dimanche 13 Août 2006, 21:42:21
- Dernière modification le : Lundi 14 Décembre 2009, 22:06:48






Documents associés
Exporter