Experimental Application of a Japanese Historical Document Image Synthesis Method to Text Line Segmentation

被引:1
|
作者
Inuzuka, Naoto [1 ]
Suzuki, Tetsuya [1 ]
机构
[1] Shibaura Inst Technol, Grad Sch Syst Engn & Sci, Saitama, Japan
来源
PROCEEDINGS OF THE 10TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION APPLICATIONS AND METHODS (ICPRAM) | 2021年
关键词
Text Line Segmentation; Historical Document; Deep Learning; Data Synthesis;
D O I
10.5220/0010330206280634
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We plan to use a text line segmentation method based on machine learning in our transcription support system for handwritten Japanese historical document in Kana, and are searching for a data synthesis method of annotated document images because it is time consuming to manually annotate a large set of document images for training data for machine learning. In this paper, we report our synthesis method of annotated document images designed for a Japanese historical document. To compare manually annotated Japanese historical document images and annotated document images synthesized by the method as training data for an object detection algorithm YOLOv3, we conducted text line segmentation experiments using the object detection algorithm. The experimental results show that a model trained by the synthetic annotated document images are competitive with that trained by the manually annotated document images from the view point of a metric intersection-over-union.
引用
收藏
页码:628 / 634
页数:7
相关论文
共 50 条
  • [41] An Active Contour Based Method for Image Binarization: Application to degraded historical document images
    Hadjadj, Zineb
    Meziane, Abdelkrim
    Cheriet, Mohamed
    Cherfa, Yazid
    2014 14TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR), 2014, : 655 - 660
  • [42] An Active Contour Based Method for Image Binarization: Application to degraded historical document images
    Hadjadj, Zineb
    Meziane, Abdelkrim
    Cheriet, Mohamed
    Cherfa, Yazid
    2014 4TH INTERNATIONAL SYMPOSIUM ISKO-MAGHREB: CONCEPTS AND TOOLS FOR KNOWLEDGE MANAGEMENT (ISKO-MAGHREB), 2014,
  • [43] Segmentation of on-line handwritten Japanese text using SVM for improving text recognition
    Zhu, BL
    Tokuno, J
    Nakagawa, M
    DOCUMENT ANALYSIS SYSTEMS VII, PROCEEDINGS, 2006, 3872 : 208 - 219
  • [44] Historical document image segmentation using background light intensity normalization
    Shi, ZX
    Govindaraju, V
    DOCUMENT RECOGNITION AND RETRIEVAL XII, 2005, 5676 : 167 - 174
  • [45] An Efficient Cooperative Smearing Technique for Degraded Historical Document Image Segmentation
    Boudraa, Omar
    Hidouci, Walid Khaled
    Michelucci, Dominique
    INTERNATIONAL JOURNAL OF IMAGE AND GRAPHICS, 2021, 21 (02)
  • [46] A method for combining complementary techniques for document image segmentation
    Stamatopoulos, Nikolaos
    Gatos, Basilis
    Perantonis, Stavros J.
    PATTERN RECOGNITION, 2009, 42 (12) : 3158 - 3168
  • [47] Segmentation of on-line handwritten Japanese text of arbitrary line direction by a neural network for improving text recognition
    Zhu, BL
    Nakagawa, M
    EIGHTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS 1 AND 2, PROCEEDINGS, 2005, : 157 - 161
  • [48] Learning-Free Text Line Segmentation for Historical Handwritten Documents
    Barakat, Berat Kurar
    Cohen, Rafi
    Droby, Ahmad
    Rabaev, Irina
    El-Sana, Jihad
    APPLIED SCIENCES-BASEL, 2020, 10 (22): : 1 - 19
  • [49] Frame Detection and Text Line Segmentation for Early Japanese Books Understanding
    Bing, Lyu
    Tomiyama, Hiroyuki
    Meng, Lin
    ICPRAM: PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION APPLICATIONS AND METHODS, 2020, : 600 - 606
  • [50] Fringe map based text line segmentation of printed Telugu document images
    Department of CSE, CMR College of Engineering and Technology, Hyderabad 501401, India
    不详
    Proc. Int. Conf. Doc. Anal. Recognit., (1294-1298):