Handwritten Text Segmentation via End-to-End Learning of Convolutional Neural Networks

被引:0
|
作者
Junho Jo
Hyung Il Koo
Jae Woong Soh
Nam Ik Cho
机构
[1] Seoul National University,Dept. of Electrical and Computer Engineering
[2] Seoul National University,Dept. of Electrical and Computer Eng., INMC
[3] Ajou University,Dept. of Electrical and Computer Engineering
来源
关键词
Handwritten text segmentation; Text separation; Data synthesis; Class imbalance problem; Optical character recognition;
D O I
暂无
中图分类号
学科分类号
摘要
We present a method that separates handwritten and machine-printed components that are mixed and overlapped in documents. Many conventional methods addressed this problem by extracting connected components (CCs) and classifying the extracted CCs into two classes. They were based on the assumption that two types of components are not overlapping each other, while we are focusing on more challenging and realistic cases where the components are often overlapping each other. For this, we propose a new method that performs pixel-level classification with a convolutional neural network. Unlike conventional neural network methods, our method works in an end-to-end manner and does not require any preprocessing steps (e.g., foreground extraction, handcrafted feature extraction, and so on). For the training of our network, we develop a cross-entropy based loss function to alleviate the class imbalance problem. Regarding the training dataset, although there are some datasets of mixed printed characters and handwritten scripts, most of them do not have overlapping cases and do not provide pixel-level annotations. Hence, we also propose a data synthesis method that generates realistic pixel-level training samples having many overlappings of printed and handwritten components. Experimental results on synthetic and real images have shown the effectiveness of the proposed method. Although the proposed network has been trained only with synthetic images, it also improves the OCR rate of real documents. Specifically, the OCR rate for machine-printed texts is increased from 0.8087 to 0.9442 by removing the overlapped handwritten scribbles by our method.
引用
收藏
页码:32137 / 32150
页数:13
相关论文
共 50 条
  • [31] Two-Stage Transfer Learning of End-to-End Convolutional Neural Networks for Webpage Saliency Prediction
    Shan, Wei
    Sun, Guangling
    Zhou, Xiaofei
    Liu, Zhi
    INTELLIGENCE SCIENCE AND BIG DATA ENGINEERING, ISCIDE 2017, 2017, 10559 : 316 - 324
  • [32] End-to-End Learning of Deformable Mixture of Parts and Deep Convolutional Neural Networks for Human Pose Estimation
    Yang, Wei
    Ouyang, Wanli
    Li, Hongsheng
    Wang, Xiaogang
    2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 3073 - 3082
  • [33] End-to-end learning of user equilibrium with implicit neural networks
    Liu, Zhichen
    Yin, Yafeng
    Bai, Fan
    Grimm, Donald K.
    TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2023, 150
  • [34] End-to-end probabilistic forecasting of electricity price via convolutional neural network and label distribution learning
    He, Hui
    Lu, Nanyan
    Jiang, Yizhi
    Chen, Bo
    Jiao, Runhai
    ENERGY REPORTS, 2020, 6 : 1176 - 1183
  • [35] End-to-End Neural Text Classification for Tibetan
    Qun, Nuo
    Li, Xing
    Qiu, Xipeng
    Huang, Xuanjing
    CHINESE COMPUTATIONAL LINGUISTICS AND NATURAL LANGUAGE PROCESSING BASED ON NATURALLY ANNOTATED BIG DATA, CCL 2017, 2017, 10565 : 472 - 480
  • [36] Image Shadow Removal Using End-To-End Deep Convolutional Neural Networks
    Fan, Hui
    Han, Meng
    Li, Jinjiang
    APPLIED SCIENCES-BASEL, 2019, 9 (05):
  • [37] End-to-end Convolutional Neural Networks for Sound Event Detection in Urban Environments
    Zinemanas, Pablo
    Cancela, Pablo
    Rocamora, Martin
    PROCEEDINGS OF THE 24TH CONFERENCE OF OPEN INNOVATIONS ASSOCIATION (FRUCT), 2019, : 533 - 539
  • [38] Towards End-to-End Speech Recognition with Deep Multipath Convolutional Neural Networks
    Zhang, Wei
    Zhai, Minghao
    Huang, Zilong
    Liu, Chen
    Li, Wei
    Cao, Yi
    INTELLIGENT ROBOTICS AND APPLICATIONS, ICIRA 2019, PART VI, 2019, 11745 : 332 - 341
  • [39] Towards end-to-end likelihood-free inference with convolutional neural networks
    Radev, Stefan T.
    Mertens, Ulf K.
    Voss, Andreas
    Koethe, Ullrich
    BRITISH JOURNAL OF MATHEMATICAL & STATISTICAL PSYCHOLOGY, 2020, 73 (01): : 23 - 43
  • [40] A new end-to-end image compression system based on convolutional neural networks
    Akyazi, Pinar
    Ebrahimi, Touradj
    APPLICATIONS OF DIGITAL IMAGE PROCESSING XLII, 2019, 11137