Recognition of Table of Contents for Electronic Library

Abdel Belaïd 1 Nabil Murshed
1 READ - READ
LORIA - Laboratoire Lorrain de Recherche en Informatique et ses Applications
Abstract : A labeling approach for automatic recognition of Tables of Contents (ToC) is described in this paper. A prototype is used for electronic consulting of scientific papers in a digital library system named Calliope. This method operates on a roughly structured ASCII file, produced by OCR. The recognition approach operates by text labeling without using any a priori model. Labeling is based on a Part of Speech Tagging (PoS) which is initiated by a primary labeling of text component using some specific dictionaries.
Type de document :
Communication dans un congrès
4th International Workshop on Document Analysis Systems - DAS'2000, 2000, Rio de Janeiro, Brésil, 28 p, 2000
Liste complète des métadonnées

https://hal.inria.fr/inria-00099147
Contributeur : Publications Loria <>
Soumis le : mardi 26 septembre 2006 - 08:51:21
Dernière modification le : jeudi 11 janvier 2018 - 06:20:00

Identifiants

  • HAL Id : inria-00099147, version 1

Collections

Citation

Abdel Belaïd, Nabil Murshed. Recognition of Table of Contents for Electronic Library. 4th International Workshop on Document Analysis Systems - DAS'2000, 2000, Rio de Janeiro, Brésil, 28 p, 2000. 〈inria-00099147〉

Partager

Métriques

Consultations de la notice

49