TEI Encoding of a Classical Mixtec Dictionary Using GROBID- Dictionaries - Archive ouverte HAL Access content directly
Conference Papers Year :

TEI Encoding of a Classical Mixtec Dictionary Using GROBID- Dictionaries

Abstract

This paper presents the application of GROBID-Dictionaries (Khemakhem et al. 2017, Khemakhem et al. 2018a, Khemakhem et al. 2018b, Khemakhem et al. 2018c), an open source machine learning system for automatically structuring print dictionaries in digital format into TEI (Text Encoding Initiative) to a historical lexical resource of Colonial Mixtec 'Voces del Dzaha Dzahui' published by the Dominican fray Francisco Alvarado in the year 1593. The GROBID-Dictionaries application was applied to a reorganized and modernized version of the historical resource published by Jansen and Perez Jiménez (2009). The TEI dictionary produced will be integrated into a language documentation project dealing with Mixtepec-Mixtec (ISO 639-3: mix) (Bowers & Romary, 2017, 2018a, 2018b) an under-resourced indigenous language native to the Juxtlahuaca district of Oaxaca Mexico.
Fichier principal
Vignette du fichier
eLex_2019_abstract_111.pdf (1.04 Mo) Télécharger le fichier
Origin : Files produced by the author(s)
Loading...

Dates and versions

hal-02264033 , version 1 (06-08-2019)

Licence

Attribution - CC BY 4.0

Identifiers

  • HAL Id : hal-02264033 , version 1

Cite

Jack Bowers, Mohamed Khemakhem, Laurent Romary. TEI Encoding of a Classical Mixtec Dictionary Using GROBID- Dictionaries. ELEX 2019: Smart Lexicography, Oct 2019, Sintra, Portugal. ⟨hal-02264033⟩
227 View
252 Download

Share

Gmail Facebook Twitter LinkedIn More