Universal Dependencies for the AnCora treebanks - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Article Dans Une Revue Procesamiento del Lenguaje Natural Année : 2016

Universal Dependencies for the AnCora treebanks

Résumé

Abstract: The present article describes the conversion of the Catalan and Spanish AnCora treebanks to the Universal Dependencies formalism. We describe the conversion process and assess the quality of the resulting treebank in terms of parsing accuracy by means of monolingual, cross-lingual and cross-domain parsing evaluation. The converted treebanks show an internal consistency comparable to the one shown by the original CoNLL09 distribution of AnCora, and indicate some differences in terms of multiword expression inventory with regards to the already existing UD Spanish treebank. The two new converted treebanks will be released in version 1.3 of Universal Dependencies.
Fichier principal
Vignette du fichier
5341-4677-1-PB.pdf (272.13 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-01426751 , version 1 (04-01-2017)

Identifiants

  • HAL Id : hal-01426751 , version 1

Citer

Hector Martinez Alonso, Daniel Zeman. Universal Dependencies for the AnCora treebanks . Procesamiento del Lenguaje Natural, 2016, 57. ⟨hal-01426751⟩
115 Consultations
205 Téléchargements

Partager

Gmail Facebook X LinkedIn More