Creating and maintaining language resources: the main guidelines of the Victoria project

Abstract : Many Natural Language Processing (NLP) tools rely on the availability of reliable language resources (LRs). Moreover, even when such LRs are available for a given language, their quality or coverage sometimes prevent them from being used in complex NLP systems. Considering the attention received from both the academic and industrial worlds and the significant efforts achieved during the past decades for LR development, such a lack of high quality and wide-coverage LR shows how difficult their creation and maintainance can be. In this paper, we describe a set of guidelines applied within the Victoria project in order to ease the creation and correction of the LRs required for symbolic parsing. These generic guidelines should be easy to adapt and use for the production of other types of LRs.
Type de document :
Communication dans un congrès
Workshop on Language Resources: From Storyboard to Sustainability and LR Lifecycle Management (LREC 2010 workshop), May 2010, Valletta, Malta. 2010
Liste complète des métadonnées


https://hal.inria.fr/inria-00521241
Contributeur : Benoît Sagot <>
Soumis le : dimanche 26 septembre 2010 - 22:36:27
Dernière modification le : mercredi 12 octobre 2016 - 01:23:18
Document(s) archivé(s) le : jeudi 25 octobre 2012 - 16:01:55

Fichier

victoria_lrec2010_final.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : inria-00521241, version 1

Collections

Citation

Lionel Nicolas, Miguel Molinero, Benoît Sagot, Nieves Fernández Formoso, Vanesa Vidal Castro. Creating and maintaining language resources: the main guidelines of the Victoria project. Workshop on Language Resources: From Storyboard to Sustainability and LR Lifecycle Management (LREC 2010 workshop), May 2010, Valletta, Malta. 2010. <inria-00521241>

Partager

Métriques

Consultations de
la notice

268

Téléchargements du document

308