Growing TreeLex

Anna Kupsc 1, 2, 3 Anne Abeillé 4, 5
3 SIGNES - Linguistic signs, grammar and meaning: computational logic for natural language
Université Sciences et Technologies - Bordeaux 1, Inria Bordeaux - Sud-Ouest, École Nationale Supérieure d'Électronique, Informatique et Radiocommunications de Bordeaux (ENSEIRB), CNRS - Centre National de la Recherche Scientifique : UMR5800
Abstract : TreeLex is a subcategorization lexicon of French, automatically extracted from a syntactically annotated corpus. The lexicon comprises 2006 verbs (25076 occurrences). The goal of the project is to obtain a list of subcategorization frames of contemporary French verbs and to estimate the number of different verb frames available in French in general. A few more frames are discovered when the corpus size changes, but the average number of frames per verb remains relatively stable (about 1.91--2.09 frames per verb).
Type de document :
Communication dans un congrès
9th International Conference, CICLing 2008, Feb 2008, Haifa, Israel. pp.28--39, 2008
Liste complète des métadonnées

https://hal.inria.fr/inria-00338103
Contributeur : Anna Kupsc <>
Soumis le : lundi 10 novembre 2008 - 21:21:59
Dernière modification le : mercredi 23 mai 2018 - 17:58:03

Identifiants

  • HAL Id : inria-00338103, version 1

Citation

Anna Kupsc, Anne Abeillé. Growing TreeLex. 9th International Conference, CICLing 2008, Feb 2008, Haifa, Israel. pp.28--39, 2008. 〈inria-00338103〉

Partager

Métriques

Consultations de la notice

130