DP-LinkNet: A convolutional network for historical document image binarization

被引:42
|
作者
Xiong, Wei [1 ,2 ]
Jia, Xiuhong [1 ]
Yang, Dichun [1 ]
Ai, Meihui [1 ]
Li, Lirong [1 ]
Wang, Song [2 ]
机构
[1] Hubei Univ Technol, Sch Elect & Elect Engn, Wuhan 430068, Hubei, Peoples R China
[2] Univ South Carolina, Dept Comp Sci & Engn, Columbia, SC 29201 USA
基金
中国国家自然科学基金;
关键词
Degraded document image binarization; semantic segmentation; DP-LinkNet; encoder-decoder architecture; & nbsp; hybrid dilated convolution (HDC); spatial pyramid pooling (SPP); COMPETITION;
D O I
10.3837/tiis.2021.05.011
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Document image binarization is an important pre-processing step in document analysis and archiving. The state-of-the-art models for document image binarization are variants of encoder-decoder architectures, such as FCN (fully convolutional network) and U-Net. Despite their success, they still suffer from three limitations: (1) reduced feature map resolution due to consecutive strided pooling or convolutions, (2) multiple scales of target objects, and (3) reduced localization accuracy due to the built-in invariance of deep convolutional neural networks (DCNNs). To overcome these three challenges, we propose an improved semantic segmentation model, referred to as DP-LinkNet, which adopts the D-LinkNet architecture as its backbone, with the proposed hybrid dilated convolution (HDC) and spatial pyramid pooling (SPP) modules between the encoder and the decoder. Extensive experiments are conducted on recent document image binarization competition (DIBCO) and handwritten document image binarization competition (H-DIBCO) benchmark datasets. Results show that our proposed DP-LinkNet outperforms other state-of-the-art techniques by a large margin. Our implementation and the pre-trained models are available at https://github.com/beargolden/DP-LinkNet.
引用
收藏
页码:1778 / 1797
页数:20
相关论文
共 50 条
  • [1] FD-Net: A Fully Dilated Convolutional Network for Historical Document Image Binarization
    Xiong, Wei
    Yue, Ling
    Zhou, Lei
    Wei, Liying
    Li, Min
    PATTERN RECOGNITION AND COMPUTER VISION, PT I, 2021, 13019 : 518 - 529
  • [2] Historical document image binarization
    Mello, Carlos A. B.
    Oliveira, Adriano L. I.
    Sanchez, Angel
    VISAPP 2008: PROCEEDINGS OF THE THIRD INTERNATIONAL CONFERENCE ON COMPUTER VISION THEORY AND APPLICATIONS, VOL 1, 2008, : 108 - 113
  • [3] Historical Document Image Binarization: A Review
    Tensmeyer C.
    Martinez T.
    SN Computer Science, 2020, 1 (3)
  • [4] Document Image Binarization with Fully Convolutional Neural Networks
    Tensmeyer, Chris
    Martinez, Tony
    2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), VOL 1, 2017, : 99 - 104
  • [5] Performance Evaluation Methodology for Historical Document Image Binarization
    Ntirogiannis, Konstantinos
    Gatos, Basilis
    Pratikakis, Ioannis
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2013, 22 (02) : 595 - 609
  • [6] Insights on the Use of Convolutional Neural Networks for Document Image Binarization
    Pastor-Pellicer, J.
    Espana-Boquera, S.
    Zamora-Martinez, F.
    Afzal, M. Zeshan
    Jose Castro-Bleda, Maria
    ADVANCES IN COMPUTATIONAL INTELLIGENCE, PT II, 2015, 9095 : 115 - 126
  • [7] Using Convolutional Encoder-Decoder for Document Image Binarization
    Peng, Xujun
    Cao, Huaigu
    Natarajan, Prem
    2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), VOL 1, 2017, : 708 - 713
  • [8] Historical Document Image Binarization Based on Edge Contrast Information
    Li, Zhenjiang
    Wang, Weilan
    Cai, Zhengqi
    ADVANCES IN COMPUTER VISION, CVC, VOL 1, 2020, 943 : 614 - 628
  • [9] Restoration Based Contourlet Transform for Historical Document Image Binarization
    Zemouri, ET-Tahir
    Chibani, Youcef
    Brik, Youcef
    2014 INTERNATIONAL CONFERENCE ON MULTIMEDIA COMPUTING AND SYSTEMS (ICMCS), 2014, : 309 - 313
  • [10] A hybrid CNN-Transformer model for Historical Document Image Binarization
    Rezanezhad, Vahid
    Baierer, Konstantin
    Neudecker, Clemens
    PROCEEDINGS OF THE 2023 INTERNATIONAL WORKSHOP ON HISTORICAL DOCUMENT IMAGING AND PROCESSING, HIP 2023, 2023, : 79 - 84