Experimental Application of a Japanese Historical Document Image Synthesis Method to Text Line Segmentation

被引:1
|
作者
Inuzuka, Naoto [1 ]
Suzuki, Tetsuya [1 ]
机构
[1] Shibaura Inst Technol, Grad Sch Syst Engn & Sci, Saitama, Japan
来源
PROCEEDINGS OF THE 10TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION APPLICATIONS AND METHODS (ICPRAM) | 2021年
关键词
Text Line Segmentation; Historical Document; Deep Learning; Data Synthesis;
D O I
10.5220/0010330206280634
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We plan to use a text line segmentation method based on machine learning in our transcription support system for handwritten Japanese historical document in Kana, and are searching for a data synthesis method of annotated document images because it is time consuming to manually annotate a large set of document images for training data for machine learning. In this paper, we report our synthesis method of annotated document images designed for a Japanese historical document. To compare manually annotated Japanese historical document images and annotated document images synthesized by the method as training data for an object detection algorithm YOLOv3, we conducted text line segmentation experiments using the object detection algorithm. The experimental results show that a model trained by the synthetic annotated document images are competitive with that trained by the manually annotated document images from the view point of a metric intersection-over-union.
引用
收藏
页码:628 / 634
页数:7
相关论文
共 50 条
  • [31] A Hybrid Method for Historical Degraded Document Image
    NaouelOuafek
    Mohamed-KhireddineKholladi
    2018 INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND INTELLIGENT SYSTEMS (CIIS 2018), 2018, : 66 - 70
  • [32] TEXT LINE DETECTION IN MULTICOLUMN FOR INDIAN SCRIPTS USING HISTOGRAM: A DOCUMENT IMAGE ANALYSIS APPLICATION
    Kumar, Umesh
    Raheja, Jagdish
    PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER THEORY AND ENGINEERING (ICACTE 2009), VOLS 1 AND 2, 2009, : 161 - 168
  • [33] A Robust Hybrid Approach for Text Line Segmentation in Historical Documents
    Clausner, Christian
    Antonacopoulos, Apostolos
    Pletschacher, Stefan
    2012 21ST INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR 2012), 2012, : 335 - 338
  • [34] Reducing the Human Effort in Text Line Segmentation for Historical Documents
    Granell, Emilio
    Quiros, Lorenzo
    Romero, Veronica
    Andreu Sanchez, Joan
    DOCUMENT ANALYSIS AND RECOGNITION, ICDAR 2021, PT III, 2021, 12823 : 523 - 537
  • [35] A Novel Text Line Segmentation Method Based on Contour Curve Tracking for Tibetan Historical Documents
    Zhou, Fengming
    Wang, Weilan
    Lin, Qiang
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2018, 32 (10)
  • [36] A Multilevel Text line Segmentation Framework for Handwritten Historical Documents
    Ben Messaoud, Ines
    Amiri, Hamid
    El Abed, Haikal
    Maergner, Volker
    13TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR 2012), 2012, : 515 - 520
  • [37] Text Line Segmentation in Handwritten Document Images Using Tensor Voting
    Toan Dinh Nguyen
    Gueesang Lee
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2011, E94A (11) : 2434 - 2441
  • [38] An End-to-End Framework for Evaluating Explainable Deep Models: Application to Historical Document Image Segmentation
    Brini, Iheb
    Mehri, Maroua
    Ingold, Rolf
    Ben Amara, Najoua Essoukri
    COMPUTATIONAL COLLECTIVE INTELLIGENCE, ICCCI 2022, 2022, 13501 : 106 - 119
  • [39] A Text Line Extraction Method for Archival Document Transcription
    Mechi, Olfa
    Mehri, Maroua
    Ingold, Rolf
    Ben Amara, Najoua Essoukri
    PROCEEDINGS OF THE 2020 17TH INTERNATIONAL MULTI-CONFERENCE ON SYSTEMS, SIGNALS & DEVICES (SSD 2020), 2020, : 479 - 484
  • [40] An Efficient Line Segmentation Approach for Handwritten Bangla Document Image
    Mullick, K.
    Banerjee, S.
    Bhattacharya, U.
    2015 EIGHTH INTERNATIONAL CONFERENCE ON ADVANCES IN PATTERN RECOGNITION (ICAPR), 2015, : 130 - +