Speeding up corpus development for linguistic research: language documentation and acquisition in Romansh Tuatschin

Abstract : In this paper, we present ongoing work for developing language resources and basic NLP tools for an undocumented variety of Romansh, in the context of a language documentation and language acquisition project. Our tools are designed to improve the speed and reliability of corpus annotations for noisy data involving large amounts of code-switching, occurrences of child speech and orthographic noise. Being able to increase the efficiency of language resource development for language documentation and acquisition research also constitutes a step towards solving the data sparsity issues with which researchers have been struggling.
Type de document :
Communication dans un congrès
Joint SIGHUM Workshop on Computational Linguistics for Cultural Heritage, Social Sciences, Humanities and Literature, Aug 2017, Vancouver, Canada. pp.89 - 94, Proceedings of the Joint SIGHUM Workshop on Computational Linguistics for Cultural Heritage, Social Sciences, Humanities and Literature. 〈https://sighum.wordpress.com/events/latech-clfl-2017/〉. 〈10.18653/v1/W17-2212〉
Liste complète des métadonnées

Littérature citée [11 références]  Voir  Masquer  Télécharger

https://hal.inria.fr/hal-01570614
Contributeur : Benoît Sagot <>
Soumis le : lundi 31 juillet 2017 - 19:08:29
Dernière modification le : dimanche 25 février 2018 - 10:46:04

Fichier

speeding-corpus-development-10...
Fichiers produits par l'(les) auteur(s)

Identifiants

Collections

Citation

Géraldine Walther, Benoît Sagot. Speeding up corpus development for linguistic research: language documentation and acquisition in Romansh Tuatschin. Joint SIGHUM Workshop on Computational Linguistics for Cultural Heritage, Social Sciences, Humanities and Literature, Aug 2017, Vancouver, Canada. pp.89 - 94, Proceedings of the Joint SIGHUM Workshop on Computational Linguistics for Cultural Heritage, Social Sciences, Humanities and Literature. 〈https://sighum.wordpress.com/events/latech-clfl-2017/〉. 〈10.18653/v1/W17-2212〉. 〈hal-01570614〉

Partager

Métriques

Consultations de la notice

137

Téléchargements de fichiers

27