sign in
english version rss feed

inria-00088077, version 2

Learning n-ary Node Selecting Tree Transducers from Completely Annotated Examples

Aurélien Lemay a1, Joachim Niehren () 1, Rémi Gilleron a1

8th International Colloquium on Grammatical Inference 4201 (2006) 253-267

Abstract: We present the first algorithm for learning n-ary node selection queries in trees from completely annotated examples by methods of grammatical inference. We propose to represent n-ary queries by deterministic n-ary node selecting tree transducers (NSTTs), that are known to capture the class of MSO-definable n-ary queries. Despite of this highly expressive, we show that n-aryy queries, selecting a polynomially bounded number of tuples per tree, represented by deterministic NSTTs can be learned from polynomial time and data while allowing for efficient enumeration of query answers. An application to wrapper induction in Web information extraction yields encouraging results.

  • a –  Université Charles de Gaulle - Lille III
  • 1:  MOSTRARE (INRIA Futurs)
  • INRIA – CNRS : UMR8022 – Université des Sciences et Technologies de Lille - Lille I : EA3588 – Université Charles de Gaulle - Lille III
 
  • inria-00088077, version 2
  • oai:hal.inria.fr:inria-00088077
  • From: 
  • Submitted on: Sunday, 13 August 2006 21:42:21
  • Updated on: Monday, 14 December 2009 22:06:48
all articles on CCSd database...
all articles on CCSd database...
all articles on CCSd database...
all articles on CCSd database...
all articles on CCSd database...
all articles on CCSd database...
all articles on CCSd database...
all articles on CCSd database...
all articles on CCSd database...
all articles on CCSd database...
all articles on CCSd database...
all articles on CCSd database...
all articles on CCSd database...
all articles on CCSd database...
all articles on CCSd database...
all articles on CCSd database...
all articles on CCSd database...
all articles on CCSd database...
all articles on CCSd database...