Skip to Main content Skip to Navigation
Conference papers

Context Processing to Read Text on Damaged Wooden Tablets

Abstract : This paper describes context processing to present candidates for damaged scripts on wooden tablets (mokkans). Since mokkans excavated from old strata have been damaged, even archeologists can hardly read scripts on mokkans. Very often, ink in several areas are faded out or completely lost, some characters might be misrecognized based on which other characters must be read. The context processing extends the Aho-Corasick method to allow self-transition and presents candidates even for scripts with lost ink and misrecognized characters. For evaluation, we employed 4,041 place names in Japan at the 8th century as the vocabulary. Each place name consists of 9 to 11 characters. Test keywords were prepared with 1 to 6 characters lost and 0 to 2 characters replaced by others from the vocabulary. Even for those with 5 characters lost and one character is replaced, the method nominates correct names in the top 10 candidates with 71.7% correctness.
Complete list of metadata

Cited literature [7 references]  Display  Hide  Download

https://hal.inria.fr/inria-00104895
Contributor : Anne Jaigu <>
Submitted on : Monday, October 9, 2006 - 4:23:01 PM
Last modification on : Wednesday, February 10, 2021 - 1:12:01 PM
Long-term archiving on: : Tuesday, April 6, 2010 - 5:41:36 PM

Identifiers

  • HAL Id : inria-00104895, version 1

Collections

Citation

Akihito Kitadai, Kazu Nishijima, Kei Saito, Masaki Nakagawa, Hajime Baba, et al.. Context Processing to Read Text on Damaged Wooden Tablets. Tenth International Workshop on Frontiers in Handwriting Recognition, Université de Rennes 1, Oct 2006, La Baule (France). ⟨inria-00104895⟩

Share

Metrics

Record views

94

Files downloads

125