DP-LinkNet: A convolutional network for historical document image binarization

被引:42
|
作者
Xiong, Wei [1 ,2 ]
Jia, Xiuhong [1 ]
Yang, Dichun [1 ]
Ai, Meihui [1 ]
Li, Lirong [1 ]
Wang, Song [2 ]
机构
[1] Hubei Univ Technol, Sch Elect & Elect Engn, Wuhan 430068, Hubei, Peoples R China
[2] Univ South Carolina, Dept Comp Sci & Engn, Columbia, SC 29201 USA
基金
中国国家自然科学基金;
关键词
Degraded document image binarization; semantic segmentation; DP-LinkNet; encoder-decoder architecture; & nbsp; hybrid dilated convolution (HDC); spatial pyramid pooling (SPP); COMPETITION;
D O I
10.3837/tiis.2021.05.011
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Document image binarization is an important pre-processing step in document analysis and archiving. The state-of-the-art models for document image binarization are variants of encoder-decoder architectures, such as FCN (fully convolutional network) and U-Net. Despite their success, they still suffer from three limitations: (1) reduced feature map resolution due to consecutive strided pooling or convolutions, (2) multiple scales of target objects, and (3) reduced localization accuracy due to the built-in invariance of deep convolutional neural networks (DCNNs). To overcome these three challenges, we propose an improved semantic segmentation model, referred to as DP-LinkNet, which adopts the D-LinkNet architecture as its backbone, with the proposed hybrid dilated convolution (HDC) and spatial pyramid pooling (SPP) modules between the encoder and the decoder. Extensive experiments are conducted on recent document image binarization competition (DIBCO) and handwritten document image binarization competition (H-DIBCO) benchmark datasets. Results show that our proposed DP-LinkNet outperforms other state-of-the-art techniques by a large margin. Our implementation and the pre-trained models are available at https://github.com/beargolden/DP-LinkNet.
引用
收藏
页码:1778 / 1797
页数:20
相关论文
共 50 条
  • [31] Fast binarization algorithm for document image
    Shanghai Jiaotong Univ, Shanghai, China
    Hongwai Yu Haomibo Xuebao, 5 (344-350):
  • [32] Continual Learning for Document Image Binarization
    Garrido-Munoz, Carlos
    Sanchez-Hernandez, Adrian
    Castellanos, Francisco J.
    Calvo-Zaragoza, Jorge
    2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 1443 - 1449
  • [33] A Novel Approach for Document Image Binarization
    Vishnupriya, S.
    Saranya, P.
    Elangovan, E.
    ICACCS 2015 PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING & COMMUNICATION SYSTEMS, 2015,
  • [34] iDocChip - A Configurable Hardware Architecture for Historical Document Image Processing: Percentile Based Binarization
    Rybalkin, Vladimir
    Bukhari, Syed Saqib
    Ghaffar, Muhammad Mohsin
    Ghafoor, Aqib
    Wehn, Norbert
    Dengel, Andreas
    PROCEEDINGS OF THE ACM SYMPOSIUM ON DOCUMENT ENGINEERING (DOCENG 2018), 2018,
  • [35] Efficient Binarization of Historical and Degraded Document Images
    Gatos, B.
    Pratikakis, I.
    Perantonis, S. J.
    PROCEEDINGS OF THE 8TH IAPR INTERNATIONAL WORKSHOP ON DOCUMENT ANALYSIS SYSTEMS, 2008, : 447 - 454
  • [36] Innovative Binarization Solutions for Historical Document Clarity
    Kulkarni, Radhika, V
    Mude, Vedant
    Nagrale, Rutuj
    Nirgude, Aarya
    Nirmal, Tejashri
    2024 4TH INTERNATIONAL CONFERENCE ON PERVASIVE COMPUTING AND SOCIAL NETWORKING, ICPCSN 2024, 2024, : 210 - 217
  • [37] An Active Contour Based Method for Image Binarization: Application to degraded historical document images
    Hadjadj, Zineb
    Meziane, Abdelkrim
    Cheriet, Mohamed
    Cherfa, Yazid
    2014 14TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR), 2014, : 655 - 660
  • [38] An Active Contour Based Method for Image Binarization: Application to degraded historical document images
    Hadjadj, Zineb
    Meziane, Abdelkrim
    Cheriet, Mohamed
    Cherfa, Yazid
    2014 4TH INTERNATIONAL SYMPOSIUM ISKO-MAGHREB: CONCEPTS AND TOOLS FOR KNOWLEDGE MANAGEMENT (ISKO-MAGHREB), 2014,
  • [39] A Dilated MultiRes Visual Attention U-Net for historical document image binarization
    Detsikas, Nikolaos
    Mitianoudis, Nikolaos
    Papamarkos, Nikolaos
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2024, 122
  • [40] Restoration of degraded historical document image: An adaptive multilayer-information binarization technique
    Khankasikam, Krisda, 1600, Institute of Information Science (30):