inria-00088077, version 2
Learning n-ary Node Selecting Tree Transducers from Completely Annotated Examples
Aurélien Lemay a, 1Joachim Niehren
1Rémi Gilleron a, 1
8th International Colloquium on Grammatical Inference 4201 (2006) 253-267
Abstract: We present the first algorithm for learning n-ary node selection queries in trees from completely annotated examples by methods of grammatical inference. We propose to represent n-ary queries by deterministic n-ary node selecting tree transducers (NSTTs), that are known to capture the class of MSO-definable n-ary queries. Despite of this highly expressive, we show that n-aryy queries, selecting a polynomially bounded number of tuples per tree, represented by deterministic NSTTs can be learned from polynomial time and data while allowing for efficient enumeration of query answers. An application to wrapper induction in Web information extraction yields encouraging results.
- a – Université Charles de Gaulle - Lille III
- 1: MOSTRARE (INRIA Futurs)
- INRIA – CNRS : UMR8022 – Université des Sciences et Technologies de Lille - Lille I : EA3588 – Université Charles de Gaulle - Lille III
- Domain : Computer Science/Learning
- Available versions : v1 (2006-07-31) v2 (2006-08-14)
- inria-00088077, version 2
- http://hal.inria.fr/inria-00088077
- oai:hal.inria.fr:inria-00088077
- From: Joachim Niehren
- Submitted on: Sunday, 13 August 2006 21:42:21
- Updated on: Monday, 14 December 2009 22:06:48






Associated documents
Export