Insights on the Use of Convolutional Neural Networks for Document Image Binarization

被引:0
|
作者
Pastor-Pellicer, J. [1 ]
Espana-Boquera, S. [1 ]
Zamora-Martinez, F. [2 ]
Afzal, M. Zeshan [3 ]
Jose Castro-Bleda, Maria [1 ]
机构
[1] Univ Politecn Valencia, Dept Sistemas Informat & Computac, E-46022 Valencia, Spain
[2] Univ CEU Cardenal Herrera, Dept Ciencias Fis Math & Computac, Valencia, Spain
[3] German Res Ctr Artificial Intelligence DFKI, Kaiserslautern, Germany
关键词
D O I
10.1007/978-3-319-19222-2_10
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Convolutional Neural Networks have systematically shown good performance in Computer Vision and in Handwritten Text Recognition tasks. This paper proposes the use of these models for document image binarization. The main idea is to classify each pixel of the image into foreground and background from a sliding window centered at the pixel to be classified. An experimental analysis on the effect of sensitive parameters and some working topologies are proposed using two different corpora, of very different properties: DIBCO and Santgall.
引用
收藏
页码:115 / 126
页数:12
相关论文
共 50 条
  • [21] License Plate Character Recognition Using Binarization and Convolutional Neural Networks
    Angara, Sandeep
    Robinson, Melvin
    ADVANCES IN COMPUTER VISION, CVC, VOL 1, 2020, 943 : 272 - 283
  • [22] FD-Net: A Fully Dilated Convolutional Network for Historical Document Image Binarization
    Xiong, Wei
    Yue, Ling
    Zhou, Lei
    Wei, Liying
    Li, Min
    PATTERN RECOGNITION AND COMPUTER VISION, PT I, 2021, 13019 : 518 - 529
  • [23] SOME INSIGHTS INTO CONVOLUTIONAL NEURAL NETWORKS
    Zhai, Jun-Hai
    Zang, Li-Guang
    Zhang, Su-Fang
    PROCEEDINGS OF 2017 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS (ICMLC), VOL 1, 2017, : 123 - 129
  • [24] MSIO: MultiSpectral Document Image BinarizatIOn
    Diem, Markus
    Hollaus, Fabian
    Sablatnig, Robert
    PROCEEDINGS OF 12TH IAPR WORKSHOP ON DOCUMENT ANALYSIS SYSTEMS, (DAS 2016), 2016, : 84 - 89
  • [25] A MULTISCALE OPERATOR FOR DOCUMENT IMAGE BINARIZATION
    Dorini, Leyza Baldo
    Leite, Neucimar Jeronimo
    VISAPP 2009: PROCEEDINGS OF THE FOURTH INTERNATIONAL CONFERENCE ON COMPUTER VISION THEORY AND APPLICATIONS, VOL 1, 2009, : 34 - 39
  • [26] Investigating coupling preprocessing with shallow and deep convolutional neural networks in document image classification
    Liu, Yi
    Soh, Leen-Kiat
    Lorang, Elizabeth
    JOURNAL OF ELECTRONIC IMAGING, 2021, 30 (04)
  • [27] A Hybrid Approach for Document Image Binarization
    Sakila, A.
    Vijayarani, S.
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INVENTIVE COMPUTING AND INFORMATICS (ICICI 2017), 2017, : 645 - 650
  • [28] Historical Document Image Binarization: A Review
    Tensmeyer C.
    Martinez T.
    SN Computer Science, 2020, 1 (3)
  • [29] Adaptive degraded document image binarization
    Gatos, B
    Pratikakis, I
    Perantonis, SJ
    PATTERN RECOGNITION, 2006, 39 (03) : 317 - 327
  • [30] Augment Document Image Binarization by Learning
    Zhu, Yuanping
    19TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOLS 1-6, 2008, : 1905 - 1908