A Self-attention Based Model for Offline Handwritten Text Recognition

被引:2
|
作者
Nam Tuan Ly [1 ]
Trung Tan Ngo [1 ]
Nakagawa, Masaki [1 ]
机构
[1] Tokyo Univ Agr & Technol, Tokyo, Japan
来源
关键词
Self-attention; Multi-head; Handwritten text recognition; CNN; BLSTM; CTC; SEQUENCE;
D O I
10.1007/978-3-031-02444-3_27
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Offline handwritten text recognition is an important part of document analysis and it has been receiving a lot of attention from numerous researchers for decades. In this paper, we present a self-attention-based model for offline handwritten textline recognition. The proposed model consists of three main components: a feature extractor by CNN; an encoder by a BLSTM network and a self-attention module; and a decoder by CTC. The self-attention module is complementary to RNN in the encoder and helps the encoder to capture long-range and multi-level dependencies across an input sequence. According to the extensive experiments on the two datasets of IAM Handwriting and Kuzushiji, the proposed model achieves better accuracy than the state-of-the-art models. The self-attention map visualization shows that the self-attention mechanism helps the encoder capture long-range and multi-level dependencies across an input sequence.
引用
下载
收藏
页码:356 / 369
页数:14
相关论文
共 50 条
  • [11] Offline Arabic Handwritten Text Recognition: A Survey
    Parvez, Mohammad Tanvir
    Mahmoud, Sabri A.
    ACM COMPUTING SURVEYS, 2013, 45 (02)
  • [12] A Database for Offline Arabic Handwritten Text Recognition
    Mahmoud, Sabri A.
    Ahmad, Irfan
    Alshayeb, Mohammed
    Al-Khatib, Wasfi G.
    IMAGE ANALYSIS AND RECOGNITION: 8TH INTERNATIONAL CONFERENCE, ICIAR 2011, PT II: 8TH INTERNATIONAL CONFERENCE, ICIAR 2011, 2011, 6754 : 397 - 406
  • [13] Self-Attention based Siamese Neural Network recognition Model
    Liu, Yuxing
    Chang, Geng
    Fu, Guofeng
    Wei, Yingchao
    Lan, Jie
    Liu, Jiarui
    2022 34TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2022, : 721 - 724
  • [14] Lexicon and Attention Based Handwritten Text Recognition System
    Kumari L.
    Singh S.
    Rathore V.V.S.
    Sharma A.
    Machine Graphics and Vision, 2022, 31 (1-4): : 75 - 92
  • [15] Attention-Based Deep Learning Model for Arabic Handwritten Text Recognition
    Gader T.B.A.
    Echi A.K.
    Machine Graphics and Vision, 2022, 31 (1-4): : 49 - 73
  • [16] CALText: Contextual Attention Localization for Offline Handwritten Text
    Anjum, Tayaba
    Khan, Nazar
    NEURAL PROCESSING LETTERS, 2023, 55 (06) : 7227 - 7257
  • [17] CALText: Contextual Attention Localization for Offline Handwritten Text
    Tayaba Anjum
    Nazar Khan
    Neural Processing Letters, 2023, 55 : 7227 - 7257
  • [18] A transformer-based approach for Arabic offline handwritten text recognition
    Saleh Momeni
    Bagher BabaAli
    Signal, Image and Video Processing, 2024, 18 : 3053 - 3062
  • [19] A transformer-based approach for Arabic offline handwritten text recognition
    Momeni, Saleh
    Babaali, Bagher
    SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (04) : 3053 - 3062
  • [20] A Bayesian-based probabilistic model for unconstrained handwritten offline Chinese text line recognition
    Li, Nanxi
    Jin, Lianwen
    IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC 2010), 2010, : 3664 - 3668