Modelling frequency data -- Methodological considerations on the relationship between dictionaries and corpora

Abstract : The research questions addressed in our paper stem from a bundle of linguistically focused projects which -among other activities- also create glossaries and dictionaries which are intended to be usable both for human readers and particular NLP applications. The paper will comprise two parts: in the first section, the authors will give a concise overview of the projects and their goals. The second part will concentrate on encoding issues involved in the related dictionary production. Particular focus will be put on the modelling of an encoding scheme for statistical information on lexicographic data gleaned from digital corpora.
Type de document :
Communication dans un congrès
TEI Conference 2013, Oct 2013, Roma, Italy. 2013
Liste complète des métadonnées

Littérature citée [6 références]  Voir  Masquer  Télécharger

https://hal.inria.fr/hal-00922068
Contributeur : Laurent Romary <>
Soumis le : lundi 23 décembre 2013 - 13:58:48
Dernière modification le : vendredi 3 novembre 2017 - 08:24:01
Document(s) archivé(s) le : lundi 24 mars 2014 - 00:35:08

Fichiers

tei_rome_abstract.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : hal-00922068, version 1

Collections

Citation

Gerhard Budin, Karlheinz Mörth, Laurent Romary. Modelling frequency data -- Methodological considerations on the relationship between dictionaries and corpora. TEI Conference 2013, Oct 2013, Roma, Italy. 2013. 〈hal-00922068〉

Partager

Métriques

Consultations de la notice

423

Téléchargements de fichiers

373