Characters as graphs: Interpretable handwritten Chinese character recognition via Pyramid Graph Transformer

被引:9
|
作者
Gan, Ji [1 ,2 ]
Chen, Yuyan [1 ]
Hu, Bo [1 ,2 ]
Leng, Jiaxu [1 ,2 ]
Wang, Weiqiang [3 ]
Gao, Xinbo [1 ,2 ]
机构
[1] Chongqing Univ Posts & Telecommun, Coll Comp Sci & Technol, Chongqing, Peoples R China
[2] Chongqing Inst Brain & Intelligence, Guangyang Bay Lab, Chongqing, Peoples R China
[3] Univ Chinese Acad Sci, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
Handwritten Chinese character Recognition; Transformer; Graph convolutional network; Pyramid graph; ONLINE; REPRESENTATION; EXTRACTION;
D O I
10.1016/j.patcog.2023.109317
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
It is meaningful but challenging to teach machines to recognize handwritten Chinese characters. However, conventional approaches typically view handwritten Chinese characters as either static images or tempo-ral trajectories, which may ignore the inherent geometric semantics of characters. Instead, here we first propose to represent handwritten characters as skeleton graphs, explicitly considering the natural charac-teristics of characters (i.e., characters as graphs). Furthermore, we propose a novel Pyramid Graph Trans-former (PyGT) to specifically process the graph-structured characters, which fully integrates the advan-tages of Transformers and graph convolutional networks. Specifically, our PyGT can learn better graph fea-tures through (i) capturing the global information from all nodes with graph attention mechanism and (ii) modelling the explicit local adjacency structures of nodes with graph convolutions. Furthermore, the PyGT learns the multi-resolution features by constructing a progressive shrinking pyramid. Compared with ex-isting approaches, it is more interpretable to recognize characters as geometric graphs. Moreover, the pro-posed method is generic for both online and offline handwritten Chinese character recognition (HCCR), and it also can be feasibly extended to handwritten text recognition. Extensive experiments empirically demonstrate the superiority of PyGT over the prevalent approaches including 2D-CNN, RNN/1D-CNN, and Vision Transformer (ViT) for HCCR. The code is available at https://github.com/ganji15/PyGT-HCCR .& COPY; 2023 Elsevier Ltd. All rights reserved.
引用
收藏
页数:13
相关论文
共 50 条
  • [41] The recognition of handwritten Chinese characters from paper records
    Loudon, G
    Hong, C
    Wu, YM
    Zitserman, R
    1996 IEEE TENCON - DIGITAL SIGNAL PROCESSING APPLICATIONS PROCEEDINGS, VOLS 1 AND 2, 1996, : 923 - 926
  • [42] Recognition of handwritten similar Chinese characters by neural networks
    Fu, HC
    Chen, JM
    NEURAL NETWORKS FOR SIGNAL PROCESSING VI, 1996, : 320 - 329
  • [43] Handwritten Character String Recognition Using Transformer and CNN Features
    Rakuka, Shunya
    Morita, Kento
    Wakabayashi, Tetsushi
    2024 Joint 13th International Conference on Soft Computing and Intelligent Systems and 25th International Symposium on Advanced Intelligent Systems, SCIS and ISIS 2024, 2024,
  • [44] Multiple candidate characters in the post-processing for off-line handwritten chinese character recognition
    Li, YX
    Ding, XQ
    2001 INTERNATIONAL CONFERENCES ON INFO-TECH AND INFO-NET PROCEEDINGS, CONFERENCE A-G: INFO-TECH & INFO-NET: A KEY TO BETTER LIFE, 2001, : C438 - C443
  • [45] Graph-to-Graph: Towards Accurate and Interpretable Online Handwritten Mathematical Expression Recognition
    Wu, Jin-Wen
    Yin, Fei
    Zhang, Yan-Ming
    Zhang, Xu-Yao
    Liu, Cheng-Lin
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 2925 - 2933
  • [46] An integration approach to handwritten Chinese character recognition system
    Hongwei Hao
    Ruwei Dai
    Science in China Series E: Technological Sciences, 1998, 41 : 101 - 105
  • [47] Markov random fields for handwritten Chinese character recognition
    Zeng, J
    Liu, ZQ
    EIGHTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS 1 AND 2, PROCEEDINGS, 2005, : 101 - 105
  • [48] Deep Matching Network for Handwritten Chinese Character Recognition
    Li, Zhiyuan
    Wu, Qi
    Xiao, Yi
    Jin, Min
    Lu, Huaxiang
    Pattern Recognition, 2020, 107
  • [49] Deep Neural Networks for Handwritten Chinese Character Recognition
    Maidana, Renan G.
    Monteiro, Juarez
    Granada, Roger
    Amory, Alexandre M.
    Barros, Rodrigo C.
    2017 6TH BRAZILIAN CONFERENCE ON INTELLIGENT SYSTEMS (BRACIS), 2017, : 192 - 197
  • [50] A character image restoration method for unconstrained handwritten Chinese character recognition
    Yunxue Shao
    Chunheng Wang
    Baihua Xiao
    International Journal on Document Analysis and Recognition (IJDAR), 2015, 18 : 73 - 86