DP-LinkNet: A convolutional network for historical document image binarization

被引:42
|
作者
Xiong, Wei [1 ,2 ]
Jia, Xiuhong [1 ]
Yang, Dichun [1 ]
Ai, Meihui [1 ]
Li, Lirong [1 ]
Wang, Song [2 ]
机构
[1] Hubei Univ Technol, Sch Elect & Elect Engn, Wuhan 430068, Hubei, Peoples R China
[2] Univ South Carolina, Dept Comp Sci & Engn, Columbia, SC 29201 USA
基金
中国国家自然科学基金;
关键词
Degraded document image binarization; semantic segmentation; DP-LinkNet; encoder-decoder architecture; & nbsp; hybrid dilated convolution (HDC); spatial pyramid pooling (SPP); COMPETITION;
D O I
10.3837/tiis.2021.05.011
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Document image binarization is an important pre-processing step in document analysis and archiving. The state-of-the-art models for document image binarization are variants of encoder-decoder architectures, such as FCN (fully convolutional network) and U-Net. Despite their success, they still suffer from three limitations: (1) reduced feature map resolution due to consecutive strided pooling or convolutions, (2) multiple scales of target objects, and (3) reduced localization accuracy due to the built-in invariance of deep convolutional neural networks (DCNNs). To overcome these three challenges, we propose an improved semantic segmentation model, referred to as DP-LinkNet, which adopts the D-LinkNet architecture as its backbone, with the proposed hybrid dilated convolution (HDC) and spatial pyramid pooling (SPP) modules between the encoder and the decoder. Extensive experiments are conducted on recent document image binarization competition (DIBCO) and handwritten document image binarization competition (H-DIBCO) benchmark datasets. Results show that our proposed DP-LinkNet outperforms other state-of-the-art techniques by a large margin. Our implementation and the pre-trained models are available at https://github.com/beargolden/DP-LinkNet.
引用
收藏
页码:1778 / 1797
页数:20
相关论文
共 50 条
  • [11] Histogram Peak Ratio-Based Binarization for Historical Document Image
    Mahastama, Aditya W.
    Krisnawati, Lucia D.
    PROCEEDINGS OF 2017 INTERNATIONAL CONFERENCE ON SMART CITIES, AUTOMATION & INTELLIGENT COMPUTING SYSTEMS (ICON-SONICS 2017), 2017, : 93 - 98
  • [12] Historical document image binarization using background estimation and energy minimization
    Xiong, Wei
    Jia, Xiuhong
    Xu, Jingjing
    Xiong, Zijie
    Liu, Min
    Wang, Juan
    2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 3716 - 3721
  • [13] Histogram peak ratio-based binarization for historical document image
    Information Technology Dept., Duta Wacana Christian University, Yogyakarta, Indonesia
    Proc. Int. Conf. Smart Cities, Autom. Intell. Comput. Syst., ICON-SONICS, 1600, (93-98):
  • [14] Historical document image binarization via style augmentation and atrous convolutions
    Rasyidi, Hanif
    Khan, Salman
    NEURAL COMPUTING & APPLICATIONS, 2021, 33 (13): : 7339 - 7352
  • [15] Historical document image binarization via style augmentation and atrous convolutions
    Hanif Rasyidi
    Salman Khan
    Neural Computing and Applications, 2021, 33 : 7339 - 7352
  • [16] Document Image Binarization With Stroke Boundary Feature Guided Network
    Dang, Quang-Vinh
    Lee, Guee-Sang
    IEEE ACCESS, 2021, 9 : 36924 - 36936
  • [17] Document Image Binarization Process
    Prodan, Marcel
    Boiangiu, Costin-Anton
    BRAIN-BROAD RESEARCH IN ARTIFICIAL INTELLIGENCE AND NEUROSCIENCE, 2023, 14 (02): : 93 - 114
  • [18] An Efficient Transformer-CNN Network for Document Image Binarization
    Zhang, Lina
    Wang, Kaiyuan
    Wan, Yi
    ELECTRONICS, 2024, 13 (12)
  • [19] Adaptive document image binarization
    Sauvola, J
    Pietikäinen, M
    PATTERN RECOGNITION, 2000, 33 (02) : 225 - 236
  • [20] Adaptive binarization of historical document images
    Kavallieratou, Ergina
    Stathis, Stamatatos
    18TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 3, PROCEEDINGS, 2006, : 742 - +