Creating and maintaining language resources: the main guidelines of the Victoria project

Abstract : Many Natural Language Processing (NLP) tools rely on the availability of reliable language resources (LRs). Moreover, even when such LRs are available for a given language, their quality or coverage sometimes prevent them from being used in complex NLP systems. Considering the attention received from both the academic and industrial worlds and the significant efforts achieved during the past decades for LR development, such a lack of high quality and wide-coverage LR shows how difficult their creation and maintainance can be. In this paper, we describe a set of guidelines applied within the Victoria project in order to ease the creation and correction of the LRs required for symbolic parsing. These generic guidelines should be easy to adapt and use for the production of other types of LRs.
Document type :
Conference papers
Complete list of metadatas

Cited literature [9 references]  Display  Hide  Download

https://hal.inria.fr/inria-00521241
Contributor : Benoît Sagot <>
Submitted on : Sunday, September 26, 2010 - 10:36:27 PM
Last modification on : Thursday, August 29, 2019 - 2:24:09 PM
Long-term archiving on : Thursday, October 25, 2012 - 4:01:55 PM

File

victoria_lrec2010_final.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : inria-00521241, version 1

Citation

Lionel Nicolas, Miguel Molinero, Benoît Sagot, Nieves Fernández Formoso, Vanesa Vidal Castro. Creating and maintaining language resources: the main guidelines of the Victoria project. Workshop on Language Resources: From Storyboard to Sustainability and LR Lifecycle Management (LREC 2010 workshop), May 2010, Valletta, Malta. ⟨inria-00521241⟩

Share

Metrics

Record views

357

Files downloads

477