Document Image Binarization Using Recurrent Neural Networks

被引:28
|
作者
Westphal, Florian [1 ]
Lavesson, Niklas [1 ]
Grahn, Hakan [1 ]
机构
[1] Blekinge Inst Technol, Dept Comp Sci & Engn, Karlskrona, Sweden
关键词
image binarization; recurrent neural networks; Grid LSTM; historical documents;
D O I
10.1109/DAS.2018.71
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the context of document image analysis, image binarization is an important preprocessing step for other document analysis algorithms, but also relevant on its own by improving the readability of images of historical documents. While historical document image binarization is challenging due to common image degradations, such as bleedthrough, faded ink or stains, achieving good binarization performance in a timely manner is a worthwhile goal to facilitate efficient information extraction from historical documents. In this paper, we propose a recurrent neural network based algorithm using Grid Long Short-Term Memory cells for image binarization, as well as a pseudo F-Measure based weighted loss function. We evaluate the binarization and execution performance of our algorithm for different choices of footprint size, scale factor and loss function. Our experiments show a significant trade-off between binarization time and quality for different footprint sizes. However, we see no statistically significant difference when using different scale factors and only limited differences for different loss functions. Lastly, we compare the binarization performance of our approach with the best performing algorithm in the 2016 handwritten document image binarization contest and show that both algorithms perform equally well.
引用
收藏
页码:263 / 268
页数:6
相关论文
共 50 条
  • [31] Improved binarization algorithm for document image
    Chen, Dan
    Zhang, Feng
    He, Guiming
    [J]. Jisuanji Gongcheng/Computer Engineering, 2003, 29 (13):
  • [32] Document Image Binarization Based on NFCM
    Tong Li-Jing
    Chen Kan
    Zhang Yan
    Fu Xiao-Ling
    Duan Jian-Yong
    [J]. PROCEEDINGS OF THE 2009 2ND INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, VOLS 1-9, 2009, : 1769 - 1773
  • [33] A Survey on Document Image Binarization Techniques
    Lokhande, Supriya Sunil
    Dawande, N. A.
    [J]. 1ST INTERNATIONAL CONFERENCE ON COMPUTING COMMUNICATION CONTROL AND AUTOMATION ICCUBEA 2015, 2015, : 742 - 746
  • [34] Document image binarization by using texture-edge descriptor
    Armanfard, N.
    Valizadeh, M.
    Komeili, M.
    Kabir, E.
    [J]. 2009 14TH INTERNATIONAL COMPUTER CONFERENCE, 2009, : 133 - 138
  • [35] Parameter tuning for document image binarization using a racing algorithm
    Mesquita, Rafael G.
    Silva, Ricardo M. A.
    Mello, Carlos A. B.
    Miranda, Pericles B. C.
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2015, 42 (05) : 2593 - 2603
  • [36] A Hybrid Approach for Document Image Binarization
    Sakila, A.
    Vijayarani, S.
    [J]. PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INVENTIVE COMPUTING AND INFORMATICS (ICICI 2017), 2017, : 645 - 650
  • [37] Fast binarization algorithm for document image
    Shanghai Jiaotong Univ, Shanghai, China
    [J]. Hongwai Yu Haomibo Xuebao, 5 (344-350):
  • [38] A Novel Approach for Document Image Binarization
    Vishnupriya, S.
    Saranya, P.
    Elangovan, E.
    [J]. ICACCS 2015 PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING & COMMUNICATION SYSTEMS, 2015,
  • [39] Historical Document Image Binarization: A Review
    Tensmeyer C.
    Martinez T.
    [J]. SN Computer Science, 2020, 1 (3)
  • [40] Binarization of Degraded Document Images Using Convolutional Neural Networks and Wavelet-Based Multichannel Images
    Akbari, Younes
    Al-Maadeed, Somaya
    Adam, Kalthoum
    [J]. IEEE ACCESS, 2020, 8 (08): : 153517 - 153534