A Registry of Standard Data Categories for Linguistic Annotation

Nancy Ide Laurent Romary 1
1 LANGUE ET DIALOGUE - Human-machine dialogue with a significant language component
INRIA Lorraine, LORIA - Laboratoire Lorrain de Recherche en Informatique et ses Applications
Abstract : In this paper we describe the most recent work within ISO TC37/SC 4, and in particular the development of a Data Category Registry (DCR) component of the Linguistic Annotation Framework. The DCR will contain a formally defined set of linguistic categories in common use within the language engineering community for reference and use in linguistically annotated resources. We outline the first proposals for creation and management of the DCR, as a solicitation for input from the community.
Document type :
Conference papers
Complete list of metadatas

https://hal.inria.fr/inria-00099858
Contributor : Laurent Romary <>
Submitted on : Tuesday, January 13, 2009 - 11:08:01 AM
Last modification on : Monday, April 8, 2019 - 10:24:04 AM
Long-term archiving on : Saturday, May 14, 2011 - 12:04:16 AM

Identifiers

  • HAL Id : inria-00099858, version 1

Collections

Citation

Nancy Ide, Laurent Romary. A Registry of Standard Data Categories for Linguistic Annotation. 4th International Conference on Language Resources and Evaluation - LREC'04, 2004, Lisbonne, Portugal, pp.135-138. ⟨inria-00099858⟩

Share

Metrics

Record views

497

Files downloads

140