Skip to Main content Skip to Navigation
Conference papers

TreeLex: A Subcategorisation Lexicon for French Verbs

Anna Kupsc 1, 2, 3, * Anne Abeillé 4, 5
* Corresponding author
3 SIGNES - Linguistic signs, grammar and meaning: computational logic for natural language
Université Sciences et Technologies - Bordeaux 1, Inria Bordeaux - Sud-Ouest, École Nationale Supérieure d'Électronique, Informatique et Radiocommunications de Bordeaux (ENSEIRB), CNRS - Centre National de la Recherche Scientifique : UMR5800
Abstract : TreeLex is a subcategorization lexicon of French verbs, automatically extracted from a syntactically annotated corpus. The lexicon comprises 1362 verbs (12353 occurrences). We present not only a list of verbs with their subcategorization frames but we also estimate the number of different verb frames available in French in general. Additionally, we estimate the average number of frames per verb. After applying various factorization techniques, we obtain 58 frames for a function-based representation (on average, 1.72 frames per verb), and 160 frames for a richer representation based on function-category information (on average, 1.91 frames per verb).
keyword : valence verbs treebank
Document type :
Conference papers
Complete list of metadata

https://hal.inria.fr/inria-00338102
Contributor : Anna Kupsc <>
Submitted on : Monday, November 10, 2008 - 9:15:03 PM
Last modification on : Saturday, June 19, 2021 - 4:10:51 AM

Identifiers

  • HAL Id : inria-00338102, version 1

Citation

Anna Kupsc, Anne Abeillé. TreeLex: A Subcategorisation Lexicon for French Verbs. First International Conference on Global Interoperability for Language Resources, Jan 2008, Hong Kong, Hong Kong SAR China. ⟨inria-00338102⟩

Share

Metrics

Record views

230