Combination of Two Fully Convolutional Neural Networks for Robust Binarization

被引:1
|
作者
Karpinski, Romain [1 ]
Belaid, Abdel [1 ]
机构
[1] Univ Lorraine, CNRS, LORIA, Campus Sci, F-54500 Vandoeuvre Les Nancy, France
来源
关键词
Historical documents; Binarization; Fully convolutional neural network;
D O I
10.1007/978-3-030-20893-6_32
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
To be able to process historical documents, it is often required to first binarize the image (background and foreground separation) before applying the processing itself. Historical documents are challenging to binarize because of the numerous degradations they suffer such as bleed-through, illuminations, background degradations or ink drops. We present in this paper a new approach to tackle this task by a combination of two neural networks. Recently, the DIBCO binarization competition has seen a growing interest in the use of supervised methods to binarize challenging images. Inspired by the winner of the DIBCO 17 competition, which uses a fully convolutional neural network (FCN), we propose a combination of two FCNs to obtain better performance. While the two FCNs have the same architecture, they are trained on different representations of the input image. The first one uses downscaled image to capture the global context and the object locations. The second one works on patches of native resolution to help defining precisely the boundaries of the characters by capturing the local context. The final prediction is obtained by combining the results of the two FCNs. We show in the experiments that this strategy provides better results and outperforms the winner of the DIBCO17 competition.
引用
收藏
页码:509 / 524
页数:16
相关论文
共 50 条
  • [1] Document Image Binarization with Fully Convolutional Neural Networks
    Tensmeyer, Chris
    Martinez, Tony
    2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), VOL 1, 2017, : 99 - 104
  • [2] Heterogeneous Bitwidth Binarization in Convolutional Neural Networks
    Fromm, Josh
    Patel, Shwetak
    Philipose, Matthai
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [3] Insights on the Use of Convolutional Neural Networks for Document Image Binarization
    Pastor-Pellicer, J.
    Espana-Boquera, S.
    Zamora-Martinez, F.
    Afzal, M. Zeshan
    Jose Castro-Bleda, Maria
    ADVANCES IN COMPUTATIONAL INTELLIGENCE, PT II, 2015, 9095 : 115 - 126
  • [4] Fully hyperbolic convolutional neural networks
    Keegan Lensink
    Bas Peters
    Eldad Haber
    Research in the Mathematical Sciences, 2022, 9
  • [5] Fully hyperbolic convolutional neural networks
    Lensink, Keegan
    Peters, Bas
    Haber, Eldad
    RESEARCH IN THE MATHEMATICAL SCIENCES, 2022, 9 (04)
  • [6] Fully shared convolutional neural networks
    Yao Lu
    Guangming Lu
    Jinxing Li
    Zheng Zhang
    Yuanrong Xu
    Neural Computing and Applications, 2021, 33 : 8635 - 8648
  • [7] Fully shared convolutional neural networks
    Lu, Yao
    Lu, Guangming
    Li, Jinxing
    Zhang, Zheng
    Xu, Yuanrong
    NEURAL COMPUTING & APPLICATIONS, 2021, 33 (14): : 8635 - 8648
  • [8] Robust Face Detector with Fully Convolutional Networks
    Su, Yingcheng
    Wan, Xiaopei
    Guo, Zhenhua
    PATTERN RECOGNITION AND COMPUTER VISION, PT III, 2018, 11258 : 207 - 218
  • [9] Robust 3D Face Alignment with Efficient Fully Convolutional Neural Networks
    Jiang, Lei
    Wu, Xiao-Jun
    Kittler, Josef
    IMAGE AND GRAPHICS, ICIG 2019, PT II, 2019, 11902 : 266 - 277
  • [10] Pixel-wise Binarization of Musical Documents with Convolutional Neural Networks
    Calvo-Zaragoza, Jorge
    Vigliensoni, Gabriel
    Fujinaga, Ichiro
    PROCEEDINGS OF THE FIFTEENTH IAPR INTERNATIONAL CONFERENCE ON MACHINE VISION APPLICATIONS - MVA2017, 2017, : 362 - 365