Information Density Based Image Binarization for Text Document Containing Graphics

Abstract : In this work, a new clustering based binarization technique has been proposed. Clustering is done depending on the information density of the input image. Here input image is considered as a set of text, images as foreground and some random noises, marks of ink, spots of oil, etc. in the background. It is often quite difficult to separate the foreground from the background based on existing binarization technique. The existing methods offer good result if the input image contains only text. Experimental results indicate that this method is particularly good for degraded text document containing graphic images as well. USC-SIPI database is used for testing phase. It is compared with iterative partitioning, Otsu’s method for seven different metrics.
Liste complète des métadonnées

Cited literature [20 references]  Display  Hide  Download

https://hal.inria.fr/hal-01637461
Contributor : Hal Ifip <>
Submitted on : Friday, November 17, 2017 - 3:43:28 PM
Last modification on : Saturday, November 18, 2017 - 1:16:36 AM
Document(s) archivé(s) le : Sunday, February 18, 2018 - 2:47:23 PM

File

419526_1_En_10_Chapter.pdf
Files produced by the author(s)

Licence


Distributed under a Creative Commons Attribution 4.0 International License

Identifiers

Collections

Citation

Soma Datta, Nabendu Chaki, Sankhayan Choudhury. Information Density Based Image Binarization for Text Document Containing Graphics. Khalid Saeed; Władysław Homenda. 15th IFIP International Conference on Computer Information Systems and Industrial Management (CISIM), Sep 2016, Vilnius, Lithuania. Springer International Publishing, Lecture Notes in Computer Science, LNCS-9842, pp.105-115, 2016, Computer Information Systems and Industrial Management. 〈10.1007/978-3-319-45378-1_10〉. 〈hal-01637461〉

Share

Metrics

Record views

67

Files downloads

11