Radical Similarity Based Model Optimization and Post-correction for Chinese Character Recognition

被引:0
|
作者
Han, Zhongyuan [1 ]
Du, Jun [1 ]
Ma, Jiefeng [1 ]
Hu, Pengfei [1 ]
Zhang, Zhenrong [1 ]
机构
[1] Univ Sci & Technol China, Hefei, Peoples R China
关键词
Radical similarity; Chinese character recognition; Bayesian risk; NETWORK;
D O I
10.1007/978-3-031-70533-5_10
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Radical-based methods for Chinese character recognition (CCR) have been proven effective and offer substantial advantages. Different from character-based methods, Chinese characters are described as combinations of structures and radicals, and character recognition is achieved by the proper identifications of these components. However, there are visual similarities among radicals, leading to the ambiguity problem for CCR, which is not fully utilized in previous work. Accordingly, in this study, we first employ the stroke order information of Chinese radicals to establish a radical similarity metric. Then we improve the radical-based CCR in two ways. During the training stage, we propose a new loss function called minimum Bayesian risk (MBR) based on the radical similarity metric to yield better performance. During the recognition stage, the radical similarity is adopted to post-correct the potential error recognition results, offering a low-cost yet effective solution. Experimental results on different radical-based CCR models and datasets demonstrate the effectiveness and robustness of our proposed method.
引用
收藏
页码:152 / 168
页数:17
相关论文
共 50 条
  • [1] Optical character recognition with neural networks and post-correction with finite state methods
    Drobac, Senka
    Linden, Krister
    INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2020, 23 (04) : 279 - 295
  • [2] Optical character recognition with neural networks and post-correction with finite state methods
    Senka Drobac
    Krister Lindén
    International Journal on Document Analysis and Recognition (IJDAR), 2020, 23 : 279 - 295
  • [3] Fertility channel model for post-correction of continuous speech recognition
    Ringger, EK
    Allen, JF
    ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 897 - 900
  • [4] Correction to: Radical-based extract and recognition networks for Oracle character recognition
    Xiaoyu Lin
    Shanxiong Chen
    Fujia Zhao
    Xiaogang Qiu
    International Journal on Document Analysis and Recognition (IJDAR), 2022, 25 : 237 - 237
  • [5] On Optimization of Traditional Chinese Character Recognition
    Huang, Yanbo
    Mondal, Subrota Kumar
    Cheng, Yuning
    Wang, Chengwei
    2024 IEEE INTERNATIONAL CONFERENCE ON SOFTWARE SERVICES ENGINEERING, SSE 2024, 2024, : 293 - 302
  • [6] Electrophysiological evidence of the character and the radical encoding in Chinese character recognition
    Chiu, Yi-Shiuan
    Su, Han-Yi
    INTERNATIONAL JOURNAL OF PSYCHOPHYSIOLOGY, 2010, 77 (03) : 307 - 308
  • [7] A Transformer-based Radical Analysis Network for Chinese Character Recognition
    Yang, Chen
    Wang, Qing
    Du, Jun
    Zhang, Jianshu
    Wu, Changjie
    Wang, Jiaming
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 3714 - 3719
  • [8] Radical Region based CNN for Offline Handwritten Chinese Character Recognition
    Luo, Weike
    Kamata, Sei-Ichiro
    PROCEEDINGS 2017 4TH IAPR ASIAN CONFERENCE ON PATTERN RECOGNITION (ACPR), 2017, : 542 - 547
  • [9] YoloGPT: Enhancing Chinese Character Recognition and Correction
    Yang, Sheng
    Lian, Zhanbiao
    Li, Kunyu
    Liu, Peilin
    Wu, Fengge
    Zhao, Junsuo
    NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, PT V, NLPCC 2024, 2025, 15363 : 377 - 388
  • [10] Chinese Character Recognition Based on Character Reconstruction
    Yun Li
    Mei Xie
    2009 INTERNATIONAL CONFERENCE ON COMMUNICATIONS, CIRCUITS AND SYSTEMS PROCEEDINGS, VOLUMES I & II: COMMUNICATIONS, NETWORKS AND SIGNAL PROCESSING, VOL I/ELECTRONIC DEVICES, CIRUITS AND SYSTEMS, VOL II, 2009, : 460 - 463