Skip to Main content Skip to Navigation
Conference papers

Document Noise Removal using Sparse Representations over Learned Dictionary

Thanh Ha Do 1 Salvatore Tabbone 1, * Oriol Ramos Terrades 2
* Corresponding author
1 QGAR - Querying Graphics through Analysis and Recognition
LORIA - NLPKD - Department of Natural Language Processing & Knowledge Discovery
Abstract : In this paper, we propose an algorithm for denoising document images using sparse representations. Following a training set, this algorithm is able to learn the main document characteristics and also, the kind of noise included into the documents. In this perspective, we propose to model the noise energy based on the normalized cross-correlation between pairs of noisy and non-noisy documents. Experimental results on several datasets demonstrate the robustness of our method compared with the state-of-the-art.
Document type :
Conference papers
Complete list of metadata

Cited literature [20 references]  Display  Hide  Download

https://hal.inria.fr/hal-00939174
Contributor : Thanh Ha Do <>
Submitted on : Thursday, February 13, 2014 - 8:52:03 AM
Last modification on : Monday, March 29, 2021 - 11:50:03 AM
Long-term archiving on: : Tuesday, May 13, 2014 - 10:11:09 PM

File

fp032-haPS.pdf
Files produced by the author(s)

Identifiers

Collections

Citation

Thanh Ha Do, Salvatore Tabbone, Oriol Ramos Terrades. Document Noise Removal using Sparse Representations over Learned Dictionary. The 13th ACM Symposium on Document Engineering, Sep 2013, Florence, Italy. ⟨10.1145/2494266.2494281⟩. ⟨hal-00939174⟩

Share

Metrics

Record views

227

Files downloads

692