Skip to Main content Skip to Navigation
Journal articles

Un modèle neuro markovien profond pour l’extraction de séquences dans des documents manuscrits

Simon Thomas 1, 2 Clément Chatelain 1, 2 Thierry Paquet 1, 2 Laurent Heutte 1, 2 
2 DocApp - LITIS - Equipe Apprentissage
LITIS - Laboratoire d'Informatique, de Traitement de l'Information et des Systèmes
Abstract : In this paper, we propose a keyword extraction system able to extract keywords in handwritten documents. The base system rely on a HMM line model made of an Out-Of-KeyWord Vocabulary model and keywords model. In order to be more discriminant at the local level (the frame level), the standard gaussian mixture of the HMM are replaced by a deep neu-ral network (DNN) for computing the observations probabilities. Experimentations are carried out on an unconstrained handwritten document database used for the 2009 ICDAR handwriting recognition competitions. The results demonstrate the interest of the keyword extraction system as opposed to the sequential integration strategy of full text recognition prior to the detection of keywords. We also show the benefit from using the deep architecture instead of the gaussian mixtures.
Document type :
Journal articles
Complete list of metadata

Cited literature [27 references]  Display  Hide  Download

https://hal.inria.fr/hal-01105363
Contributor : Clément Chatelain Connect in order to contact the contributor
Submitted on : Tuesday, January 20, 2015 - 11:10:17 AM
Last modification on : Wednesday, March 2, 2022 - 10:10:10 AM
Long-term archiving on: : Tuesday, April 21, 2015 - 10:36:46 AM

File

docnum-thomas2013.pdf
Files produced by the author(s)

Identifiers

Citation

Simon Thomas, Clément Chatelain, Thierry Paquet, Laurent Heutte. Un modèle neuro markovien profond pour l’extraction de séquences dans des documents manuscrits. Document Numérique, Lavoisier, 2013, 16 (2), pp.20. ⟨10.3166/dn.16.2.49-68⟩. ⟨hal-01105363⟩

Share

Metrics

Record views

36

Files downloads

369