Serialising the ISO SynAF Syntactic Object Model

Abstract : This paper introduces , an XML format developed to serialise the object model defined by the ISO Syntactic Annotation Framework SynAF. Based on widespread best practices we adapt a popular XML format for syntactic annotation, TigerXML, with additional features to support a variety of syntactic phenomena including constituent and dependency structures, binding, and different node types such as compounds or empty elements. We also define interfaces to other formats and standards including the Morpho-syntactic Annotation Framework MAF and the ISOCat Data Category Registry. Finally a case study of the German Treebank TueBa-D/Z is presented, showcasing the handling of constituent structures, topological fields and coreference annotation in tandem.
Document type :
Preprints, Working Papers, ...
Complete list of metadatas

https://hal.inria.fr/inria-00612833
Contributor : Laurent Romary <>
Submitted on : Friday, September 12, 2014 - 1:27:51 PM
Last modification on : Friday, March 22, 2019 - 2:22:12 PM
Long-term archiving on: Saturday, December 13, 2014 - 10:40:14 AM

Files

LREtiger2paper_2014_09.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : inria-00612833, version 2
  • ARXIV : 1108.0631

Citation

Laurent Romary, Amir Zeldes, Florian Zipser. Serialising the ISO SynAF Syntactic Object Model. 2014. ⟨inria-00612833v2⟩

Share

Metrics

Record views

396

Files downloads

32