Experimental Application of a Japanese Historical Document Image Synthesis Method to Text Line Segmentation

被引:1
|
作者
Inuzuka, Naoto [1 ]
Suzuki, Tetsuya [1 ]
机构
[1] Shibaura Inst Technol, Grad Sch Syst Engn & Sci, Saitama, Japan
来源
PROCEEDINGS OF THE 10TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION APPLICATIONS AND METHODS (ICPRAM) | 2021年
关键词
Text Line Segmentation; Historical Document; Deep Learning; Data Synthesis;
D O I
10.5220/0010330206280634
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We plan to use a text line segmentation method based on machine learning in our transcription support system for handwritten Japanese historical document in Kana, and are searching for a data synthesis method of annotated document images because it is time consuming to manually annotate a large set of document images for training data for machine learning. In this paper, we report our synthesis method of annotated document images designed for a Japanese historical document. To compare manually annotated Japanese historical document images and annotated document images synthesized by the method as training data for an object detection algorithm YOLOv3, we conducted text line segmentation experiments using the object detection algorithm. The experimental results show that a model trained by the synthetic annotated document images are competitive with that trained by the manually annotated document images from the view point of a metric intersection-over-union.
引用
收藏
页码:628 / 634
页数:7
相关论文
共 50 条
  • [21] Text Line Segmentation in Images of Handwritten Historical Documents
    Sanchez, A.
    Suarez, P. D.
    Melloz, C. A. B.
    Oliveira, A. L. I.
    Alves, V. M. O.
    2008 FIRST INTERNATIONAL WORKSHOPS ON IMAGE PROCESSING THEORY, TOOLS AND APPLICATIONS (IPTA), 2008, : 232 - +
  • [22] New method of cloud synthesis and application in image segmentation
    Xu, Kai
    Qin, Kun
    Li, Deren
    MIPPR 2007: AUTOMATIC TARGET RECOGNITION AND IMAGE ANALYSIS; AND MULTISPECTRAL IMAGE ACQUISITION, PTS 1 AND 2, 2007, 6786
  • [23] A Text-Line Segmentation Method for Historical Tibetan Documents Based on Baseline Detection
    Li, Yanxing
    Ma, Longlong
    Duan, Lijuan
    Wu, Jian
    COMPUTER VISION, PT I, 2017, 771 : 356 - 367
  • [24] DENSE PREDICTION FOR TEXT LINE SEGMENTATION IN HANDWRITTEN DOCUMENT IMAGES
    Quang Nhat Vo
    Lee, GueeSang
    2016 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2016, : 3264 - 3268
  • [25] Text line segmentation in handwritten document using a production system
    Nicolas, S
    Paquet, T
    Heutte, L
    NINTH INTERNATIONAL WORKSHOP ON FRONTIERS IN HANDWRITING RECOGNITION, PROCEEDINGS, 2004, : 245 - 250
  • [26] A Novel Method for Text and Non-Text Segmentation in Document Images
    Deivalakshmi, S.
    Palanisamy, P.
    Vishwanathan, Gayatri
    2013 INTERNATIONAL CONFERENCE ON COMMUNICATIONS AND SIGNAL PROCESSING (ICCSP), 2013, : 255 - 259
  • [27] Automated Text line Segmentation and Table detection for Pre-Printed Document Image Analysis Systems
    Rani, N. Shobha
    Pruthvi, T. R.
    Rao, Aishwarya Govinda
    Bipin, Nair B. J.
    ICSPC'21: 2021 3RD INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATION (ICPSC), 2021, : 723 - 730
  • [28] Robust Document Image Dewarping Method using Text-lines and Line Segments
    Kil, Taeho
    Seo, Wonkyo
    Koo, Hyung Il
    Cho, Nam Ik
    2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), VOL 1, 2017, : 865 - 870
  • [29] Eigenspace method for text retrieval in historical document images
    Terasawa, K
    Nagasaki, T
    Kawashima, T
    EIGHTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS 1 AND 2, PROCEEDINGS, 2005, : 437 - 441
  • [30] A Color Based Image Segmentation and its Application to Text Segmentation
    Roy, Anandarup
    Parui, Swapan Kumar
    Paul, Amitav
    Roy, Utpal
    SIXTH INDIAN CONFERENCE ON COMPUTER VISION, GRAPHICS & IMAGE PROCESSING ICVGIP 2008, 2008, : 313 - +