Handwritten Text Segmentation via End-to-End Learning of Convolutional Neural Networks

被引:0
|
作者
Junho Jo
Hyung Il Koo
Jae Woong Soh
Nam Ik Cho
机构
[1] Seoul National University,Dept. of Electrical and Computer Engineering
[2] Seoul National University,Dept. of Electrical and Computer Eng., INMC
[3] Ajou University,Dept. of Electrical and Computer Engineering
来源
关键词
Handwritten text segmentation; Text separation; Data synthesis; Class imbalance problem; Optical character recognition;
D O I
暂无
中图分类号
学科分类号
摘要
We present a method that separates handwritten and machine-printed components that are mixed and overlapped in documents. Many conventional methods addressed this problem by extracting connected components (CCs) and classifying the extracted CCs into two classes. They were based on the assumption that two types of components are not overlapping each other, while we are focusing on more challenging and realistic cases where the components are often overlapping each other. For this, we propose a new method that performs pixel-level classification with a convolutional neural network. Unlike conventional neural network methods, our method works in an end-to-end manner and does not require any preprocessing steps (e.g., foreground extraction, handcrafted feature extraction, and so on). For the training of our network, we develop a cross-entropy based loss function to alleviate the class imbalance problem. Regarding the training dataset, although there are some datasets of mixed printed characters and handwritten scripts, most of them do not have overlapping cases and do not provide pixel-level annotations. Hence, we also propose a data synthesis method that generates realistic pixel-level training samples having many overlappings of printed and handwritten components. Experimental results on synthetic and real images have shown the effectiveness of the proposed method. Although the proposed network has been trained only with synthetic images, it also improves the OCR rate of real documents. Specifically, the OCR rate for machine-printed texts is increased from 0.8087 to 0.9442 by removing the overlapped handwritten scribbles by our method.
引用
收藏
页码:32137 / 32150
页数:13
相关论文
共 50 条
  • [1] Handwritten Text Segmentation via End-to-End Learning of Convolutional Neural Networks
    Jo, Junho
    Koo, Hyung Il
    Soh, Jae Woong
    Cho, Nam Ik
    MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (43-44) : 32137 - 32150
  • [2] Leukocyte Segmentation via End-to-End Learning of Deep Convolutional Neural Networks
    Lu, Yan
    Fan, Haoyi
    Li, Zuoyong
    INTELLIGENCE SCIENCE AND BIG DATA ENGINEERING: VISUAL DATA ENGINEERING, PT I, 2019, 11935 : 191 - 200
  • [3] End-to-End Text Recognition with Convolutional Neural Networks
    Wang, Tao
    Wu, David J.
    Coates, Adam
    Ng, Andrew Y.
    2012 21ST INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR 2012), 2012, : 3304 - 3308
  • [4] Towards End-to-end Text Spotting with Convolutional Recurrent Neural Networks
    Li, Hui
    Wang, Peng
    Shen, Chunhua
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 5248 - 5256
  • [5] End-to-end face parsing via interlinked convolutional neural networks
    Zi Yin
    Valentin Yiu
    Xiaolin Hu
    Liang Tang
    Cognitive Neurodynamics, 2021, 15 : 169 - 179
  • [6] End-to-end face parsing via interlinked convolutional neural networks
    Yin, Zi
    Yiu, Valentin
    Hu, Xiaolin
    Tang, Liang
    COGNITIVE NEURODYNAMICS, 2021, 15 (01) : 169 - 179
  • [7] Convolutional Dictionary Learning by End-To-End Training of Iterative Neural Networks
    Kofler, Andreas
    Wald, Christian
    Schaeffter, Tobias
    Haltmeier, Markus
    Kolbitsch, Christoph
    2022 30TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2022), 2022, : 1213 - 1217
  • [8] Firearm Detection via Convolutional Neural Networks: Comparing a Semantic Segmentation Model Against End-to-End Solutions
    Egiazarov, Alexander
    Zennaro, Fabio Massimo
    Mavroeidis, Vasileios
    2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2020, : 1796 - 1804
  • [9] CONVOLUTIONAL ANALYSIS OPERATOR LEARNING BY END-TO-END TRAINING OF ITERATIVE NEURAL NETWORKS
    Kofler, Andreas
    Wald, Christian
    Schaeffter, Tobias
    Haltmeier, Markus
    Kolbitsch, Christoph
    2022 IEEE INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (IEEE ISBI 2022), 2022,
  • [10] End-to-end learning of convolutional neural net and dynamic programming for left ventricle segmentation
    Nguyen, Nhat M.
    Ray, Nilanjan
    MEDICAL IMAGING WITH DEEP LEARNING, VOL 121, 2020, 121 : 555 - 569