Skip to Main content Skip to Navigation
Conference papers

Growing TreeLex

Anna Kupść 1, 2, 3 Anne Abeillé 4, 5
3 SIGNES - Linguistic signs, grammar and meaning: computational logic for natural language
Université Sciences et Technologies - Bordeaux 1, Inria Bordeaux - Sud-Ouest, École Nationale Supérieure d'Électronique, Informatique et Radiocommunications de Bordeaux (ENSEIRB), CNRS - Centre National de la Recherche Scientifique : UMR5800
Abstract : TreeLex is a subcategorization lexicon of French, automatically extracted from a syntactically annotated corpus. The lexicon comprises 2006 verbs (25076 occurrences). The goal of the project is to obtain a list of subcategorization frames of contemporary French verbs and to estimate the number of different verb frames available in French in general. A few more frames are discovered when the corpus size changes, but the average number of frames per verb remains relatively stable (about 1.91--2.09 frames per verb).
Document type :
Conference papers
Complete list of metadata

https://hal.inria.fr/inria-00338103
Contributor : Anna Kupsc <>
Submitted on : Monday, November 10, 2008 - 9:21:59 PM
Last modification on : Friday, July 16, 2021 - 4:56:02 PM

Identifiers

  • HAL Id : inria-00338103, version 1

Citation

Anna Kupść, Anne Abeillé. Growing TreeLex. 9th International Conference, CICLing 2008, Feb 2008, Haifa, Israel. pp.28--39. ⟨inria-00338103⟩

Share

Metrics

Record views

201