A Registry of Standard Data Categories for Linguistic Annotation

Nancy Ide Laurent Romary 1
1 LANGUE ET DIALOGUE - Human-machine dialogue with a significant language component
INRIA Lorraine, LORIA - Laboratoire Lorrain de Recherche en Informatique et ses Applications
Abstract : In this paper we describe the most recent work within ISO TC37/SC 4, and in particular the development of a Data Category Registry (DCR) component of the Linguistic Annotation Framework. The DCR will contain a formally defined set of linguistic categories in common use within the language engineering community for reference and use in linguistically annotated resources. We outline the first proposals for creation and management of the DCR, as a solicitation for input from the community.
Type de document :
Communication dans un congrès
4th International Conference on Language Resources and Evaluation - LREC'04, 2004, Lisbonne, Portugal, pp.135-138, 2004
Liste complète des métadonnées

https://hal.inria.fr/inria-00099858
Contributeur : Laurent Romary <>
Soumis le : mardi 13 janvier 2009 - 11:08:01
Dernière modification le : jeudi 11 janvier 2018 - 06:19:48
Document(s) archivé(s) le : samedi 14 mai 2011 - 00:04:16

Fichiers

Identifiants

  • HAL Id : inria-00099858, version 1

Collections

Citation

Nancy Ide, Laurent Romary. A Registry of Standard Data Categories for Linguistic Annotation. 4th International Conference on Language Resources and Evaluation - LREC'04, 2004, Lisbonne, Portugal, pp.135-138, 2004. 〈inria-00099858〉

Partager

Métriques

Consultations de la notice

347

Téléchargements de fichiers

94