TreeLex: A Subcategorisation Lexicon for French Verbs

Anna Kupsc 1, 2, 3, * Anne Abeillé 4, 5
* Auteur correspondant
3 SIGNES - Linguistic signs, grammar and meaning: computational logic for natural language
Université Sciences et Technologies - Bordeaux 1, Inria Bordeaux - Sud-Ouest, École Nationale Supérieure d'Électronique, Informatique et Radiocommunications de Bordeaux (ENSEIRB), CNRS - Centre National de la Recherche Scientifique : UMR5800
Abstract : TreeLex is a subcategorization lexicon of French verbs, automatically extracted from a syntactically annotated corpus. The lexicon comprises 1362 verbs (12353 occurrences). We present not only a list of verbs with their subcategorization frames but we also estimate the number of different verb frames available in French in general. Additionally, we estimate the average number of frames per verb. After applying various factorization techniques, we obtain 58 frames for a function-based representation (on average, 1.72 frames per verb), and 160 frames for a richer representation based on function-category information (on average, 1.91 frames per verb).
keyword : valence verbs treebank
Type de document :
Communication dans un congrès
First International Conference on Global Interoperability for Language Resources, Jan 2008, Hong Kong, Hong Kong SAR China. 2008
Liste complète des métadonnées

https://hal.inria.fr/inria-00338102
Contributeur : Anna Kupsc <>
Soumis le : lundi 10 novembre 2008 - 21:15:03
Dernière modification le : jeudi 11 janvier 2018 - 06:22:13

Identifiants

  • HAL Id : inria-00338102, version 1

Citation

Anna Kupsc, Anne Abeillé. TreeLex: A Subcategorisation Lexicon for French Verbs. First International Conference on Global Interoperability for Language Resources, Jan 2008, Hong Kong, Hong Kong SAR China. 2008. 〈inria-00338102〉

Partager

Métriques

Consultations de la notice

129