Document Image Binarization Using Recurrent Neural Networks

被引:28
|
作者
Westphal, Florian [1 ]
Lavesson, Niklas [1 ]
Grahn, Hakan [1 ]
机构
[1] Blekinge Inst Technol, Dept Comp Sci & Engn, Karlskrona, Sweden
关键词
image binarization; recurrent neural networks; Grid LSTM; historical documents;
D O I
10.1109/DAS.2018.71
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the context of document image analysis, image binarization is an important preprocessing step for other document analysis algorithms, but also relevant on its own by improving the readability of images of historical documents. While historical document image binarization is challenging due to common image degradations, such as bleedthrough, faded ink or stains, achieving good binarization performance in a timely manner is a worthwhile goal to facilitate efficient information extraction from historical documents. In this paper, we propose a recurrent neural network based algorithm using Grid Long Short-Term Memory cells for image binarization, as well as a pseudo F-Measure based weighted loss function. We evaluate the binarization and execution performance of our algorithm for different choices of footprint size, scale factor and loss function. Our experiments show a significant trade-off between binarization time and quality for different footprint sizes. However, we see no statistically significant difference when using different scale factors and only limited differences for different loss functions. Lastly, we compare the binarization performance of our approach with the best performing algorithm in the 2016 handwritten document image binarization contest and show that both algorithms perform equally well.
引用
收藏
页码:263 / 268
页数:6
相关论文
共 50 条
  • [1] Document Image Binarization with Fully Convolutional Neural Networks
    Tensmeyer, Chris
    Martinez, Tony
    [J]. 2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), VOL 1, 2017, : 99 - 104
  • [2] Insights on the Use of Convolutional Neural Networks for Document Image Binarization
    Pastor-Pellicer, J.
    Espana-Boquera, S.
    Zamora-Martinez, F.
    Afzal, M. Zeshan
    Jose Castro-Bleda, Maria
    [J]. ADVANCES IN COMPUTATIONAL INTELLIGENCE, PT II, 2015, 9095 : 115 - 126
  • [3] Document Image Binarization Using Dual Discriminator Generative Adversarial Networks
    De, Rajonya
    Chakraborty, Anuran
    Sarkar, Ram
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2020, 27 : 1090 - 1094
  • [4] Unsupervised neural domain adaptation for document image binarization
    Castellanos, Francisco J.
    Gallego, Antonio-Javier
    Calvo-Zaragoza, Jorge
    [J]. PATTERN RECOGNITION, 2021, 119
  • [5] Document Image Binarization Process
    Prodan, Marcel
    Boiangiu, Costin-Anton
    [J]. BRAIN-BROAD RESEARCH IN ARTIFICIAL INTELLIGENCE AND NEUROSCIENCE, 2023, 14 (02): : 93 - 114
  • [6] Historical document image binarization
    Mello, Carlos A. B.
    Oliveira, Adriano L. I.
    Sanchez, Angel
    [J]. VISAPP 2008: PROCEEDINGS OF THE THIRD INTERNATIONAL CONFERENCE ON COMPUTER VISION THEORY AND APPLICATIONS, VOL 1, 2008, : 108 - 113
  • [7] Adaptive document image binarization
    Sauvola, J
    Pietikäinen, M
    [J]. PATTERN RECOGNITION, 2000, 33 (02) : 225 - 236
  • [8] Binarization of Document Image Using Optimum Threshold Modification
    Mustafa, Wan Azani
    Kader, Mohamed Mydin M. Abdul
    [J]. 1ST INTERNATIONAL CONFERENCE ON GREEN AND SUSTAINABLE COMPUTING (ICOGES) 2017, 2018, 1019
  • [9] Document Image Binarization using Structural Symmetry of Strokes
    Jia, Fuxi
    Shi, Cunzhao
    He, Kun
    Wang, Chunheng
    Xiao, Baihua
    [J]. PROCEEDINGS OF 2016 15TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR), 2016, : 411 - 416
  • [10] Binarization of document images using image dependent model
    Dawoud, A
    Kamel, M
    [J]. SIXTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, PROCEEDINGS, 2001, : 49 - 53