Combination of Two Fully Convolutional Neural Networks for Robust Binarization

被引:1
|
作者
Karpinski, Romain [1 ]
Belaid, Abdel [1 ]
机构
[1] Univ Lorraine, CNRS, LORIA, Campus Sci, F-54500 Vandoeuvre Les Nancy, France
来源
关键词
Historical documents; Binarization; Fully convolutional neural network;
D O I
10.1007/978-3-030-20893-6_32
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
To be able to process historical documents, it is often required to first binarize the image (background and foreground separation) before applying the processing itself. Historical documents are challenging to binarize because of the numerous degradations they suffer such as bleed-through, illuminations, background degradations or ink drops. We present in this paper a new approach to tackle this task by a combination of two neural networks. Recently, the DIBCO binarization competition has seen a growing interest in the use of supervised methods to binarize challenging images. Inspired by the winner of the DIBCO 17 competition, which uses a fully convolutional neural network (FCN), we propose a combination of two FCNs to obtain better performance. While the two FCNs have the same architecture, they are trained on different representations of the input image. The first one uses downscaled image to capture the global context and the object locations. The second one works on patches of native resolution to help defining precisely the boundaries of the characters by capturing the local context. The final prediction is obtained by combining the results of the two FCNs. We show in the experiments that this strategy provides better results and outperforms the winner of the DIBCO17 competition.
引用
收藏
页码:509 / 524
页数:16
相关论文
共 50 条
  • [21] Crowd abnormal detection using two-stream Fully Convolutional Neural Networks
    Wei, Hongtao
    Xiao, Yao
    Li, Ruifang
    Liu, Xinhua
    2018 10TH INTERNATIONAL CONFERENCE ON MEASURING TECHNOLOGY AND MECHATRONICS AUTOMATION (ICMTMA), 2018, : 332 - 336
  • [22] XnODR and XnIDR: Two Accurate and Fast Fully Connected Layers for Convolutional Neural Networks
    Jian Sun
    Ali Pourramezan Fard
    Mohammad H. Mahoor
    Journal of Intelligent & Robotic Systems, 2023, 109
  • [23] Convolutional Neural Networks for Robust Classification of Drones
    Dale, Holly
    Jahangir, Mohammed
    Baker, Christopher J.
    Antoniou, Michail
    Harman, Stephen
    Ahmad, Bashar, I
    2022 IEEE RADAR CONFERENCE (RADARCONF'22), 2022,
  • [24] Towards Robust Compressed Convolutional Neural Networks
    Wijayanto, Arie Wahyu
    Choong, Jun Jin
    Madhawa, Kaushalya
    Murata, Tsuyoshi
    2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (BIGCOMP), 2019, : 168 - 175
  • [25] Robust Convolutional Neural Networks for Image Recognition
    Albeahdili, Hayder M.
    Alwzwazy, Haider A.
    Islam, Naz E.
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2015, 6 (11) : 105 - 111
  • [26] Microaneurysm detection using fully convolutional neural networks
    Chudzik, Piotr
    Majumdar, Somshubra
    Caliva, Francesco
    Al-Diri, Bashir
    Hunter, Andrew
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2018, 158 : 185 - 192
  • [27] Optimizing Fully Spectral Convolutional Neural Networks on FPGA
    Liu, Shuanglong
    Luk, Wayne
    2020 INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE TECHNOLOGY (ICFPT 2020), 2020, : 39 - 47
  • [28] Fully Convolutional Neural Networks for Polyp Segmentation in Colonoscopy
    Brandao, Patrick
    Mazomenos, Evangelos
    Ciuti, Gastone
    Calio, Renato
    Bianchi, Federico
    Menciassi, Arianna
    Dario, Paolo
    Koulaouzidis, Anastasios
    Arezzo, Alberto
    Stoyanov, Danail
    MEDICAL IMAGING 2017: COMPUTER-AIDED DIAGNOSIS, 2017, 10134
  • [29] Classification of titanium microstructure with fully convolutional neural networks
    Mongkhonthanaphon, S.
    Limpiyakorn, Y.
    2018 11TH INTERNATIONAL CONFERENCE ON COMPUTER AND ELECTRICAL ENGINEERING, 2019, 1195
  • [30] Fully Convolutional Neural Networks for Newspaper Article Segmentation
    Meier, Benjamin
    Stadelmann, Thilo
    Stampfli, Jan
    Arnold, Marek
    Cieliebak, Mark
    2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), VOL 1, 2017, : 414 - 419