Document Image Binarization Using Recurrent Neural Networks

被引:28
|
作者
Westphal, Florian [1 ]
Lavesson, Niklas [1 ]
Grahn, Hakan [1 ]
机构
[1] Blekinge Inst Technol, Dept Comp Sci & Engn, Karlskrona, Sweden
关键词
image binarization; recurrent neural networks; Grid LSTM; historical documents;
D O I
10.1109/DAS.2018.71
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the context of document image analysis, image binarization is an important preprocessing step for other document analysis algorithms, but also relevant on its own by improving the readability of images of historical documents. While historical document image binarization is challenging due to common image degradations, such as bleedthrough, faded ink or stains, achieving good binarization performance in a timely manner is a worthwhile goal to facilitate efficient information extraction from historical documents. In this paper, we propose a recurrent neural network based algorithm using Grid Long Short-Term Memory cells for image binarization, as well as a pseudo F-Measure based weighted loss function. We evaluate the binarization and execution performance of our algorithm for different choices of footprint size, scale factor and loss function. Our experiments show a significant trade-off between binarization time and quality for different footprint sizes. However, we see no statistically significant difference when using different scale factors and only limited differences for different loss functions. Lastly, we compare the binarization performance of our approach with the best performing algorithm in the 2016 handwritten document image binarization contest and show that both algorithms perform equally well.
引用
收藏
页码:263 / 268
页数:6
相关论文
共 50 条
  • [41] Document image binarization using local features and Gaussian mixture modeling
    Mitianoudis, Nikolaos
    Papamarkos, Nikolaos
    [J]. IMAGE AND VISION COMPUTING, 2015, 38 : 33 - 51
  • [42] Document Image Binarization Using "Multi-Scale" Predefined Filters
    Saabni, Raid M.
    [J]. NINTH INTERNATIONAL CONFERENCE ON GRAPHIC AND IMAGE PROCESSING (ICGIP 2017), 2018, 10615
  • [43] Efficient document image binarization using heterogeneous computing and parameter tuning
    Florian Westphal
    Håkan Grahn
    Niklas Lavesson
    [J]. International Journal on Document Analysis and Recognition (IJDAR), 2018, 21 : 41 - 58
  • [44] Document Image Binarization Using Visibility Detection and Point Cloud Segmentation
    Li, Jianhong
    Chen, Yan
    Liu, Siling
    [J]. PATTERN RECOGNITION AND COMPUTER VISION,, PT III, 2021, 13021 : 92 - 104
  • [45] Efficient document image binarization using heterogeneous computing and parameter tuning
    Westphal, Florian
    Grahn, Hakan
    Lavesson, Niklas
    [J]. INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2018, 21 (1-2) : 41 - 58
  • [46] Historical document image binarization using background estimation and energy minimization
    Xiong, Wei
    Jia, Xiuhong
    Xu, Jingjing
    Xiong, Zijie
    Liu, Min
    Wang, Juan
    [J]. 2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 3716 - 3721
  • [47] Radar image segmentation using recurrent artificial neural networks
    Ziemke, T
    [J]. PATTERN RECOGNITION LETTERS, 1996, 17 (04) : 319 - 334
  • [48] Situation recognition using image moments and recurrent neural networks
    Khan, Yaser Daanial
    Ahmed, Farooq
    Khan, Sher Afzal
    [J]. NEURAL COMPUTING & APPLICATIONS, 2014, 24 (7-8): : 1519 - 1529
  • [49] Situation recognition using image moments and recurrent neural networks
    Yaser Daanial Khan
    Farooq Ahmed
    Sher Afzal Khan
    [J]. Neural Computing and Applications, 2014, 24 : 1519 - 1529
  • [50] DOCUMENT IMAGE BINARIZATION PRESERVING STROKE CONNECTIVITY
    OH, IS
    [J]. PATTERN RECOGNITION LETTERS, 1995, 16 (07) : 743 - 748