HAL will be down for maintenance from Friday, June 10 at 4pm through Monday, June 13 at 9am. More information
Skip to Main content Skip to Navigation
Conference papers

Context Processing to Read Text on Damaged Wooden Tablets

Abstract : This paper describes context processing to present candidates for damaged scripts on wooden tablets (mokkans). Since mokkans excavated from old strata have been damaged, even archeologists can hardly read scripts on mokkans. Very often, ink in several areas are faded out or completely lost, some characters might be misrecognized based on which other characters must be read. The context processing extends the Aho-Corasick method to allow self-transition and presents candidates even for scripts with lost ink and misrecognized characters. For evaluation, we employed 4,041 place names in Japan at the 8th century as the vocabulary. Each place name consists of 9 to 11 characters. Test keywords were prepared with 1 to 6 characters lost and 0 to 2 characters replaced by others from the vocabulary. Even for those with 5 characters lost and one character is replaced, the method nominates correct names in the top 10 candidates with 71.7% correctness.
Complete list of metadata

Cited literature [7 references]  Display  Hide  Download

Contributor : Anne Jaigu Connect in order to contact the contributor
Submitted on : Monday, October 9, 2006 - 4:23:01 PM
Last modification on : Wednesday, February 10, 2021 - 1:12:01 PM
Long-term archiving on: : Tuesday, April 6, 2010 - 5:41:36 PM


  • HAL Id : inria-00104895, version 1



Akihito Kitadai, Kazu Nishijima, Kei Saito, Masaki Nakagawa, Hajime Baba, et al.. Context Processing to Read Text on Damaged Wooden Tablets. Tenth International Workshop on Frontiers in Handwriting Recognition, Université de Rennes 1, Oct 2006, La Baule (France). ⟨inria-00104895⟩



Record views


Files downloads