Extraction of Type-Logical Supertags from the Spoken Dutch Corpus - Inria - Institut national de recherche en sciences et technologies du numérique Access content directly
Book Sections Year : 2010

Extraction of Type-Logical Supertags from the Spoken Dutch Corpus

Abstract

The Spoken Dutch Corpus assigns 1 million of its 9 million total words a syn- tactic annotation in the form of dependency graphs. We will look at strategies for automatically extracting a lexicon of type-logical supertags from these dependency graphs and investigate how different levels of lexical detail affect the size of the resulting lexicon as well as the performance with respect to supertag disambiguation.
Fichier principal
Vignette du fichier
st.pdf (151.07 Ko) Télécharger le fichier
Origin : Files produced by the author(s)
Loading...

Dates and versions

inria-00413347 , version 1 (03-09-2009)
inria-00413347 , version 2 (22-06-2010)

Identifiers

  • HAL Id : inria-00413347 , version 2

Cite

Richard Moot. Extraction of Type-Logical Supertags from the Spoken Dutch Corpus. Aravind Joshi and Srinivas Bangalore. Supertagging: Using Complex Lexical Descriptions in Natural Language Processing, MIT Press, 2010, ISBN-10: 0-262-01387-8 ISBN-13: 978-0-262-01387-1. ⟨inria-00413347v2⟩
260 View
298 Download

Share

Gmail Facebook X LinkedIn More