Enforcing Subcategorization Constraints in a Parser Using Sub-parses Recombining

Abstract : Treebanks are not large enough to adequately model subcategorization frames of predicative lexemes, which is an important source of lexico-syntactic constraints for parsing. As a consequence, parsers trained on such treebanks usually make mistakes when selecting the arguments of predicative lexemes. In this paper, we propose an original way to correct subcategorization errors by combining sub-parses of a sentence S that appear in the list of the n-best parses of S. The subcategorization information comes from three different resources, the first one is extracted from a treebank, the second one is computed on a large corpora and the third one is an existing syntactic lexicon. Experiments on the French Treebank showed a 15.24% reduction of erroneous subcategorization frames (SF) selections for verbs as well as a relative decrease of the error rate of 4% Labeled Accuracy Score on the state of the art parser on this treebank.
Type de document :
Communication dans un congrès
NAACL 2013 - Conference of the North American Chapter of the Association for Computational Linguistics, Jun 2013, Atlanta, United States. 2013
Liste complète des métadonnées

Littérature citée [26 références]  Voir  Masquer  Télécharger

https://hal.inria.fr/hal-00936492
Contributeur : Benoît Sagot <>
Soumis le : dimanche 26 janvier 2014 - 15:36:12
Dernière modification le : samedi 9 juin 2018 - 10:30:06
Document(s) archivé(s) le : samedi 26 avril 2014 - 22:15:52

Fichier

N13-1024.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : hal-00936492, version 1

Collections

Citation

Seyed Abolghasem Mirroshandel, Alexis Nasr, Benoît Sagot. Enforcing Subcategorization Constraints in a Parser Using Sub-parses Recombining. NAACL 2013 - Conference of the North American Chapter of the Association for Computational Linguistics, Jun 2013, Atlanta, United States. 2013. 〈hal-00936492〉

Partager

Métriques

Consultations de la notice

573

Téléchargements de fichiers

156