GROBID for Humanities When engineering meets History

Abstract : In this presentation we explore the relationship between humanists and computer scientists and the crucial need of scientific crossover, based on our experience in the interdisciplinary team ALMAnaCH at Inria, which gathers people with very different backgrounds. We focus on the use and development of the GROBID suite. GROBID is a tool initially built for extracting metadata from scientific articles. Over the years it has evolved with new features and been used in new domains (for example archival documents), with the help of specialists in the field concerned. In our example, we use it to identify mentions corresponding to the actors of armed conflicts in historical personal diaries. This is a Named Entity Recognition task made more complex by the specialized terminology due to the period (Second world war) and the presence of constraints of writing (clandestinity).
Type de document :
Communication dans un congrès
Text as a Resource. Text Mining in Historical Science, Jun 2017, Paris, France
Liste complète des métadonnées

Littérature citée [1 références]  Voir  Masquer  Télécharger
Contributeur : Charles Riondet <>
Soumis le : lundi 11 septembre 2017 - 18:46:12
Dernière modification le : jeudi 26 avril 2018 - 10:28:18


GROBID for Humanities- when en...
Fichiers produits par l'(les) auteur(s)


Distributed under a Creative Commons Paternité 4.0 International License


  • HAL Id : hal-01585693, version 1



Charles Riondet, Luca Foppiano. GROBID for Humanities When engineering meets History. Text as a Resource. Text Mining in Historical Science, Jun 2017, Paris, France. 〈hal-01585693〉



Consultations de la notice


Téléchargements de fichiers