Re-Typograph Phase I: a Proof-of-Concept for Typeface Parameter Extraction from Historical Documents

Abstract : This paper reports on the first phase of an attempt to create a full retro-engineering pipeline that aims to construct a complete set of coherent typographic parameters defining the typefaces used in a printed homogenous text. It should be stressed that this process cannot reasonably be expected to be fully automatic and that it is designed to include human interaction. Although font design is governed by a set of quite robust and formal geometric rulesets, it still heavily relies on subjective human interpretation. Furthermore, different parameters, applied to the generic rulesets may actually result in quite similar and visually difficult to distinguish typefaces, making the retro-engineering an inverse problem that is ill conditioned once shape distortions (related to the printing and/or scanning process) come into play. This work is the first phase of a long iterative process, in which we will progressively study and assess the techniques from the state-of-the-art that are most suited to our problem and investigate new directions when they prove to not quite adequate. As a first step, this is more of a feasibility proof-of-concept, that will allow us to clearly pinpoint the items that will require more in-depth research over the next iterations.
Type de document :
Communication dans un congrès
Bart Lamiroy; Eric Ringger. Document Recognition and Retrieval XXII, Feb 2015, San Francisco, United States. SPIE, 2015, Electronic Imaging. 〈http://spie.org/EI/conferencedetails/document-recognition-retrieval〉
Liste complète des métadonnées

Littérature citée [24 références]  Voir  Masquer  Télécharger

https://hal.inria.fr/hal-01086803
Contributeur : Bart Lamiroy <>
Soumis le : lundi 24 novembre 2014 - 23:34:18
Dernière modification le : jeudi 11 janvier 2018 - 06:25:25
Document(s) archivé(s) le : mercredi 25 février 2015 - 11:51:18

Fichier

paper.pdf
Fichiers produits par l'(les) auteur(s)

Licence


Distributed under a Creative Commons Paternité - Pas d'utilisation commerciale 4.0 International License

Identifiants

  • HAL Id : hal-01086803, version 1

Collections

Citation

Bart Lamiroy, Thomas Bouville, Julien Blégean, Hongliu Cao, Salah Ghamizi, et al.. Re-Typograph Phase I: a Proof-of-Concept for Typeface Parameter Extraction from Historical Documents. Bart Lamiroy; Eric Ringger. Document Recognition and Retrieval XXII, Feb 2015, San Francisco, United States. SPIE, 2015, Electronic Imaging. 〈http://spie.org/EI/conferencedetails/document-recognition-retrieval〉. 〈hal-01086803〉

Partager

Métriques

Consultations de la notice

180

Téléchargements de fichiers

351