Skip to Main content Skip to Navigation
Conference papers

Growing TreeLex

Anna Kupść 1, 2, 3 Anne Abeillé 4, 5 
3 SIGNES - Linguistic signs, grammar and meaning: computational logic for natural language
Université Sciences et Technologies - Bordeaux 1, Inria Bordeaux - Sud-Ouest, École Nationale Supérieure d'Électronique, Informatique et Radiocommunications de Bordeaux (ENSEIRB), CNRS - Centre National de la Recherche Scientifique : UMR5800
Abstract : TreeLex is a subcategorization lexicon of French, automatically extracted from a syntactically annotated corpus. The lexicon comprises 2006 verbs (25076 occurrences). The goal of the project is to obtain a list of subcategorization frames of contemporary French verbs and to estimate the number of different verb frames available in French in general. A few more frames are discovered when the corpus size changes, but the average number of frames per verb remains relatively stable (about 1.91--2.09 frames per verb).
Document type :
Conference papers
Complete list of metadata
Contributor : Anna Kupsc Connect in order to contact the contributor
Submitted on : Monday, November 10, 2008 - 9:21:59 PM
Last modification on : Sunday, June 26, 2022 - 4:42:46 AM


  • HAL Id : inria-00338103, version 1


Anna Kupść, Anne Abeillé. Growing TreeLex. 9th International Conference, CICLing 2008, Feb 2008, Haifa, Israel. pp.28--39. ⟨inria-00338103⟩



Record views