Offline handwritten mathematical expression recognition with graph encoder and transformer decoder

被引:6
|
作者
Tang, Jia-Man [1 ,2 ]
Guo, Hong-Yu [2 ,3 ]
Wu, Jin-Wen [2 ,3 ]
Yin, Fei [2 ,3 ]
Huang, Lin-Lin [1 ]
机构
[1] Beijing Jiaotong Univ, Sch Elect & Informat Engn, Beijing 100044, Peoples R China
[2] Chinese Acad Sci, State Key Lab Multimodal Artificial Intelligence S, Inst Automat, Beijing 100190, Peoples R China
[3] Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing 100049, Peoples R China
关键词
Handwritten mathematical expression recognition; Symbol detection; Graph Neural Network; Transformer;
D O I
10.1016/j.patcog.2023.110155
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Handwritten mathematical expression recognition (H MER) has attracted extensive attention. Despite the significant progress achieved in recent years attributed to the development of deep learning approaches, HMER remains a challenge due to the complex spatial structure and variable writing styles. Encoder-decoder models with attention mechanism, which treats HMER as an image-to-sequence (i.e. LaTeX) generation task, have boosted the accuracy, but suffer from low interpretability in that the symbols are not segmented explicitly. Symbol segmentation is desired for facilitating post-processing and human interaction in real applications. In this paper, we formulate the mathematical expression as a graph and propose a Graph-Encoder-Transformer-Decoder (GETD) approach for HMER . For constructing the graph from input image, candidate symbols are first detected using an object detector and represented as the nodes of a graph, called symbol graph, and the edges of the graph encodes the between-symbol relationship. The spatial information is aggregated in a graph neural network (GNN), and a Transformer-based decoder is used to identify the symbol classes and structure from the graph. Experiments on public datasets demonstrate that our GETD model achieves competitive expression recognition performance while offering good interpretability compared with previous methods.
引用
收藏
页数:11
相关论文
共 50 条
  • [21] Multi-Scale Attention with Dense Encoder for Handwritten Mathematical Expression Recognition
    Zhang, Jianshu
    Du, Jun
    Dai, Lirong
    2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 2245 - 2250
  • [22] Graph-to-Graph: Towards Accurate and Interpretable Online Handwritten Mathematical Expression Recognition
    Wu, Jin-Wen
    Yin, Fei
    Zhang, Yan-Ming
    Zhang, Xu-Yao
    Liu, Cheng-Lin
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 2925 - 2933
  • [23] Gated Convolution and Stacked Self-Attention Encoder-Decoder-Based Model for Offline Handwritten Ethiopic Text Recognition
    Tadesse, Direselign Addis
    Liu, Chuan-Ming
    Ta, Van-Dai
    INFORMATION, 2023, 14 (12)
  • [24] CoMER: Modeling Coverage for Transformer-Based Handwritten Mathematical Expression Recognition
    Zhao, Wenqi
    Gao, Liangcai
    COMPUTER VISION - ECCV 2022, PT XXVIII, 2022, 13688 : 392 - 408
  • [25] Offline handwritten mathematical recognition using adversarial learning and transformers
    Ujjwal Thakur
    Anuj Sharma
    International Journal on Document Analysis and Recognition (IJDAR), 2024, 27 : 147 - 158
  • [26] Offline handwritten mathematical recognition using adversarial learning and transformers
    Thakur, Ujjwal
    Sharma, Anuj
    INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2024, 27 (02) : 147 - 158
  • [27] AttentionHTR: Handwritten Text Recognition Based on Attention Encoder-Decoder Networks
    Kass, Dmitrijs
    Vats, Ekta
    DOCUMENT ANALYSIS SYSTEMS, DAS 2022, 2022, 13237 : 507 - 522
  • [28] Tree-based data augmentation and mutual learning for offline handwritten mathematical expression recognition
    Yang, Chen
    Du, Jun
    Zhang, Jianshu
    Wu, Changjie
    Chen, Mingjun
    Wu, JiaJia
    PATTERN RECOGNITION, 2022, 132
  • [29] Improvement of End-to-End Offline Handwritten Mathematical Expression Recognition by Weakly Supervised Learning
    Thanh-Nghia Truong
    Cuong Tuan Nguyen
    Khanh Minh Phan
    Nakagawa, Masaki
    2020 17TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR 2020), 2020, : 181 - 186
  • [30] Online handwritten mathematical expression recognition
    Buyukbayrak, Hakan
    Yanikoglu, Berrin
    Ercil, Aytul
    DOCUMENT RECOGNITION AND RETRIEVAL XIV, 2007, 6500