Language Documentation and Standards in Digital Humanities: TEI and the documentation of Mixtepec-Mixtec - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Pré-Publication, Document De Travail Année : 2019

Language Documentation and Standards in Digital Humanities: TEI and the documentation of Mixtepec-Mixtec

Résumé

This project concerns an ongoing language documentation project covering the Mixtepec-Mixtec variety of Mixtec (iso 639-3: mix). Mixtepec-Mixtec is an Otomonguean spoken by roughly 9000-10000 people in the Juxtlahuaca district of Oaxaca, and parts of the Guerrerro and Puebla states of Mexico as well as by communities living in California, Oregon, Washington and Arkansas. Among the primary facets of the work are to: create an open source body of reusable and extensible multimedia language resources encoded in TEI XML; create multi-lingual translations (English and Spanish), annotate the content according to sound theoretical linguistic principles; use the above in order to further the knowledge of all aspects of the language itself within the fields of linguistics and lexicography by producing empirical corpus-based descriptions and analyses of various aspects of the language’s features; demonstrate and evaluate the application of encoding and description standards on a collection of lexical and knowledge resources for an under-resourced non-indo-european language. In addition to providing a lasting and reusable set of resources for the MIX language, this work also aims to make strides towards bridging the gap between lexicography, language documentation, theoretical linguistics, computational linguistics and digital humanities.
Fichier principal
Vignette du fichier
Mixtepec-Mixtec-Documentation-PhD-WorkingVersion-long-20180306.pdf (97.14 Mo) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-02004005 , version 1 (01-02-2019)
hal-02004005 , version 2 (20-02-2019)

Identifiants

  • HAL Id : hal-02004005 , version 2

Citer

Jack Bowers. Language Documentation and Standards in Digital Humanities: TEI and the documentation of Mixtepec-Mixtec. 2019. ⟨hal-02004005v2⟩
311 Consultations
216 Téléchargements

Partager

Gmail Facebook X LinkedIn More