Japanese historical character recognition by focusing on character parts

被引：0

作者：

Ishikawa, Takuru ^{[1
]}

Miyazaki, Tomo ^{[1
]}

Omachi, Shinichiro ^{[1
]}

机构：

[1] Tohoku Univ, Grad Sch Engn, 6-6-05 Aoba Aramakiaza, Sendai, Miyagi 9808579, Japan

来源：

PATTERN RECOGNITION | 2024年 / 148卷

关键词：

Historical document analysis; Japanese historical character; Learning character parts; Few-shot; Zero-shot recognition;

D O I：

10.1016/j.patcog.2023.110181

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Japanese historical documents provide valuable information. Character recognition is a critical technology for the digitalization of historical documents. Sample imbalance is a significant obstacle in recognizing Japanese historical characters, kuzushiji. Thousands of kuzushiji only have less than a few samples. Thus, recognition performance deteriorates greatly in kuzushiji with a few samples. In this study, we propose a framework for transferring knowledge of character parts from font to kuzushiji. The pretraining learns character parts from synthesized font images. However, fine-tuning to kuzushiji is more complex. We propose calculating a mean squared error loss between feature vectors of kuzushiji and font images, resulting in consistent feature vectors in kuzushiji and font. Consequently, we can perform zero-shot recognition for kuzushiji using the font images of zero-sampled kuzushiji. The experimental results show that the proposed method recognized zero-sampled kuzushiji at approximately 48% accuracy. Consequently, we significantly expand the number of recognizable kuzushiji.

引用

页数：8

共 50 条

[41] OPTICAL CHARACTER RECOGNITION
不详
DATA PROCESSING, 1967, 9 (03): : 150 - 155
[42] CHARACTER-RECOGNITION
BELAID, A
HATON, JP
RECHERCHE, 1985, 16 (170): : 1188 - &
[43] A CASE FOR CHARACTER RECOGNITION
SEABROOK, K
DATA PROCESSING, 1970, 12 (03): : 247 - &
[44] CHARACTER-RECOGNITION
SHARMAN, FH
COMPUTER JOURNAL, 1965, 8 (02): : 89 - 94
[45] OPTICAL CHARACTER RECOGNITION
EAST, H
PROGRAM-NEWS OF COMPUTERS IN LIBRARIES, 1978, 12 (02): : 95 - 95
[46] OPTICAL CHARACTER RECOGNITION
SARAGA, P
WEAVER, JA
WOOLLONS, DJ
PHILIPS TECHNICAL REVIEW, 1967, 28 (5-7): : 197 - &
[47] VICTIMIZER - RECOGNITION AND CHARACTER
SILVERMAN, SM
AMERICAN JOURNAL OF PSYCHOTHERAPY, 1975, 29 (01) : 14 - 25
[48] CHARACTER RECOGNITION BY HOLOGRAPHY
GABOR, D
NATURE, 1965, 208 (5009) : 422 - &
[49] HOLOGRAPHY AND CHARACTER RECOGNITION
DICKINSON, A
MARCONI REVIEW, 1967, 30 (164): : 40 - +
[50] Graphology and Character Recognition
Krieger, P. L.
ARCHIV FUR DIE GESAMTE PSYCHOLOGIE, 1938, 100 (3-4): : 589 - 589

← 1 2 3 4 5 →