Japanese historical character recognition by focusing on character parts

被引:0
|
作者
Ishikawa, Takuru [1 ]
Miyazaki, Tomo [1 ]
Omachi, Shinichiro [1 ]
机构
[1] Tohoku Univ, Grad Sch Engn, 6-6-05 Aoba Aramakiaza, Sendai, Miyagi 9808579, Japan
关键词
Historical document analysis; Japanese historical character; Learning character parts; Few-shot; Zero-shot recognition;
D O I
10.1016/j.patcog.2023.110181
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Japanese historical documents provide valuable information. Character recognition is a critical technology for the digitalization of historical documents. Sample imbalance is a significant obstacle in recognizing Japanese historical characters, kuzushiji. Thousands of kuzushiji only have less than a few samples. Thus, recognition performance deteriorates greatly in kuzushiji with a few samples. In this study, we propose a framework for transferring knowledge of character parts from font to kuzushiji. The pretraining learns character parts from synthesized font images. However, fine-tuning to kuzushiji is more complex. We propose calculating a mean squared error loss between feature vectors of kuzushiji and font images, resulting in consistent feature vectors in kuzushiji and font. Consequently, we can perform zero-shot recognition for kuzushiji using the font images of zero-sampled kuzushiji. The experimental results show that the proposed method recognized zero-sampled kuzushiji at approximately 48% accuracy. Consequently, we significantly expand the number of recognizable kuzushiji.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] Japanese historical character recognition using convolutional neural networks
    Advanced Engineering Faculty, National Institute of Technology, Matsue College, 14-4 Nishi-ikuma, Matsue
    Shimane
    690-8518, Japan
    不详
    Shimane
    690-8518, Japan
    不详
    Shimane
    690-8518, Japan
    ICIC Express Lett Part B Appl., 12 (3159-3164):
  • [2] A STUDY ON JAPANESE HISTORICAL CHARACTER RECOGNITION USING MODULAR NEURAL NETWORKS
    Horiuchi, Tadashi
    Kato, Satoru
    INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2011, 7 (08): : 5003 - 5014
  • [3] A study on Japanese historical character recognition using modular neural networks
    Horiuchi, Tadashi
    Kato, Satoru
    2009 4th International Conference on Innovative Computing, Information and Control, ICICIC 2009, 2009, : 1507 - 1510
  • [4] The question on the historical Jesus and the character of historical recognition
    Schröter, J
    SAYINGS SOURCE Q AND THE HISTORICAL JESUS, 2001, 158 : 207 - 254
  • [5] Character recognition in a Japanese text recognition system
    Hong, T
    Srikantan, G
    Zandy, VC
    Fang, C
    Srihari, SN
    DOCUMENT RECOGNITION III, 1996, 2660 : 51 - 62
  • [6] Pattern recognition approaches to Japanese character recognition
    Das, Soumendu
    Banerjee, Sreeparna
    Advances in Intelligent and Soft Computing, 2012, 166 AISC (VOL. 1): : 83 - 92
  • [7] Recognition of hand writing Japanese character
    Sano, Tetsuya
    Ukida, Hiroyuki
    Yamamoto, Hideki
    IDAACS 2007: PROCEEDINGS OF THE 4TH IEEE WORKSHOP ON INTELLIGENT DATA ACQUISITION AND ADVANCED COMPUTING SYSTEMS: TECHNOLOGY AND APPLICATIONS, 2007, : 399 - +
  • [8] PARTS TRACKING STARTS WITH CHARACTER-RECOGNITION
    FRY, J
    I&CS-CONTROL TECHNOLOGY FOR ENGINEERS AND ENGINEERING MANAGEMENT, 1990, 63 (05): : 101 - 102
  • [9] Character Recognition in Japanese Historical Documents via Adaptive Multi-Region Model
    Wang, Yueyu
    Kamata, Sei-ichiro
    2018 JOINT 7TH INTERNATIONAL CONFERENCE ON INFORMATICS, ELECTRONICS & VISION (ICIEV) AND 2018 2ND INTERNATIONAL CONFERENCE ON IMAGING, VISION & PATTERN RECOGNITION (ICIVPR), 2018, : 404 - 409
  • [10] Japanese Character Recognition with Microstrip Line Networks
    Novac, Marian
    Bodea, Marian-Bogdan
    Anghelescu, Petre
    Gavriloaia, Bogdan-Mihai
    Fratu, Octavian
    Gavriloaia, Mariuca
    PROCEEDINGS OF THE 11TH INTERNATIONAL CONFERENCE ON ELECTRONICS, COMPUTERS AND ARTIFICIAL INTELLIGENCE (ECAI-2019), 2019,