Degraded Document Image Binarization using Novel Background Estimation Technique

被引:0
|
作者
Jindal, Harshit [1 ]
Kumar, Manoj [1 ]
Tomar, Akhil [1 ]
Malik, Ayush [1 ]
机构
[1] Delhi Technol Univ, Dept Comp Sci Engn, New Delhi, India
关键词
Document Image Processing; Degraded Document Image Binarization; Thresholding; Background estimation; Noise Removal; Otsu Thresholding; Bilateral Filtering;
D O I
10.1109/I2CT51068.2021.9418084
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Over the past few decades, the use of scanned historical document images has increased dramatically, especially with the emergence of online libraries and standard benchmark datasets like DIBCO. The historical documents are usually in very-poor conditions containing noises like large ink stains, bleed-through, liquid spills, uneven-background, spots, faded-ink, weak/thin text that makes the task of binarization very difficult. In this paper, we propose an effective degraded document image binarization algorithm that performs accurate text segmentation. Our method first estimates the background utilizing information from neighboring pixels and filter smoothening. The next step is background subtraction that helps in the compensation of background distortions. The document is segmented using Otsu thresholding, and then we process the image to remove the remaining noise and maximize text content using labelled connected components. Our method outperforms several existing and widely used binarization algorithms on F-measure, PSNR, DRD, and pseudo F-measure when evaluated on H-DIBCO 2016 and H-DIBCO 2018 datasets and can very effectively detect faint characters from a document image.
引用
收藏
页数:8
相关论文
共 50 条
  • [31] Degraded document image binarization based on combination of two complementary algorithms
    Valizadeh, M.
    Komeili, M.
    Armanfard, N.
    Kabir, E.
    2009 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTATIONAL TOOLS FOR ENGINEERING APPLICATIONS, 2009, : 596 - 600
  • [32] MULTI-SPECTRAL DOCUMENT IMAGE BINARIZATION USING IMAGE FUSION AND BACKGROUND SUBTRACTION TECHNIQUES
    Mitianoudis, Nikolaos
    Papamarkos, Nikolaos
    2014 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2014, : 5172 - 5176
  • [33] Deep Networks for Degraded Document Image Binarization through Pyramid Reconstruction
    Meng, Gaofeng
    Yuan, Kun
    Wu, Ying
    Xiang, Shiming
    Pan, Chunhong
    2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), VOL 1, 2017, : 727 - 732
  • [34] Binarization of degraded document image based on feature space partitioning and classification
    Morteza Valizadeh
    Ehsanollah Kabir
    International Journal on Document Analysis and Recognition (IJDAR), 2012, 15 : 57 - 69
  • [35] Binarization of degraded document image based on feature space partitioning and classification
    Valizadeh, Morteza
    Kabir, Ehsanollah
    INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2012, 15 (01) : 57 - 69
  • [36] GiB: A Game Theory Inspired Binarization Technique for Degraded Document Images
    Bhowmik, Showmik
    Sarkar, Ram
    Das, Bishwadeep
    Doermann, David
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (03) : 1443 - 1455
  • [37] Degraded historical document image binarization using local features and support vector machine (SVM)
    Xiong, Wei
    Xu, Jingjing
    Xiong, Zijie
    Wang, Juan
    Liu, Min
    OPTIK, 2018, 164 : 218 - 223
  • [38] Broken and degraded document images binarization
    Chen, Yiping
    Wang, Liansheng
    NEUROCOMPUTING, 2017, 237 : 272 - 280
  • [39] A Novel Degraded Document Binarization Model through Vision Transformer Network
    Yang, Mingming
    Xu, Songhua
    INFORMATION FUSION, 2023, 93 : 159 - 173
  • [40] Gabor filter-based texture for ancient degraded document image binarization
    Sehad, Abdenour
    Chibani, Youcef
    Hedjam, Rachid
    Cheriet, Mohamed
    PATTERN ANALYSIS AND APPLICATIONS, 2019, 22 (01) : 1 - 22