DP-LinkNet: A convolutional network for historical document image binarization

被引:42
|
作者
Xiong, Wei [1 ,2 ]
Jia, Xiuhong [1 ]
Yang, Dichun [1 ]
Ai, Meihui [1 ]
Li, Lirong [1 ]
Wang, Song [2 ]
机构
[1] Hubei Univ Technol, Sch Elect & Elect Engn, Wuhan 430068, Hubei, Peoples R China
[2] Univ South Carolina, Dept Comp Sci & Engn, Columbia, SC 29201 USA
基金
中国国家自然科学基金;
关键词
Degraded document image binarization; semantic segmentation; DP-LinkNet; encoder-decoder architecture; & nbsp; hybrid dilated convolution (HDC); spatial pyramid pooling (SPP); COMPETITION;
D O I
10.3837/tiis.2021.05.011
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Document image binarization is an important pre-processing step in document analysis and archiving. The state-of-the-art models for document image binarization are variants of encoder-decoder architectures, such as FCN (fully convolutional network) and U-Net. Despite their success, they still suffer from three limitations: (1) reduced feature map resolution due to consecutive strided pooling or convolutions, (2) multiple scales of target objects, and (3) reduced localization accuracy due to the built-in invariance of deep convolutional neural networks (DCNNs). To overcome these three challenges, we propose an improved semantic segmentation model, referred to as DP-LinkNet, which adopts the D-LinkNet architecture as its backbone, with the proposed hybrid dilated convolution (HDC) and spatial pyramid pooling (SPP) modules between the encoder and the decoder. Extensive experiments are conducted on recent document image binarization competition (DIBCO) and handwritten document image binarization competition (H-DIBCO) benchmark datasets. Results show that our proposed DP-LinkNet outperforms other state-of-the-art techniques by a large margin. Our implementation and the pre-trained models are available at https://github.com/beargolden/DP-LinkNet.
引用
收藏
页码:1778 / 1797
页数:20
相关论文
共 50 条
  • [42] Robust Document Image Binarization Technique for Degraded Document Images
    Su, Bolan
    Lu, Shijian
    Tan, Chew Lim
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2013, 22 (04) : 1408 - 1417
  • [43] OCR oriented binarization method of document image
    Yang, You
    CISP 2008: FIRST INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, VOL 4, PROCEEDINGS, 2008, : 622 - 625
  • [44] Gabor Filters for Degraded Document Image Binarization
    Sehad, Abdenour
    Chibani, Youcef
    Cheriet, Mohamed
    2014 14TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR), 2014, : 702 - 707
  • [45] LEARNING DOCUMENT IMAGE BINARIZATION FROM DATA
    Wu, Yue
    Natarajan, Premkumar
    Rawls, Stephen
    AbdAlmageed, Wael
    2016 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2016, : 3763 - 3767
  • [46] Document image binarization based on texture features
    Liu, Y
    Srihari, SN
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1997, 19 (05) : 540 - 544
  • [47] DIBCO 2009: document image binarization contest
    Gatos, B.
    Ntirogiannis, K.
    Pratikakis, I.
    INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2011, 14 (01) : 35 - 44
  • [48] An Improved Adaptive Document Image Binarization Method
    Zhou, Shuang-fei
    Liu, Chun-ping
    Cui, Zhi-ming
    Gong, Sheng-rong
    PROCEEDINGS OF THE 2009 2ND INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, VOLS 1-9, 2009, : 1738 - 1742
  • [49] Degraded historical document image binarization using local features and support vector machine (SVM)
    Xiong, Wei
    Xu, Jingjing
    Xiong, Zijie
    Wang, Juan
    Liu, Min
    OPTIK, 2018, 164 : 218 - 223
  • [50] Document image binarization based on texture features
    State Univ of New York at Buffalo, Buffalo, United States
    IEEE Trans Pattern Anal Mach Intell, 5 (540-544):