A Self-attention Based Model for Offline Handwritten Text Recognition

被引:2
|
作者
Nam Tuan Ly [1 ]
Trung Tan Ngo [1 ]
Nakagawa, Masaki [1 ]
机构
[1] Tokyo Univ Agr & Technol, Tokyo, Japan
来源
关键词
Self-attention; Multi-head; Handwritten text recognition; CNN; BLSTM; CTC; SEQUENCE;
D O I
10.1007/978-3-031-02444-3_27
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Offline handwritten text recognition is an important part of document analysis and it has been receiving a lot of attention from numerous researchers for decades. In this paper, we present a self-attention-based model for offline handwritten textline recognition. The proposed model consists of three main components: a feature extractor by CNN; an encoder by a BLSTM network and a self-attention module; and a decoder by CTC. The self-attention module is complementary to RNN in the encoder and helps the encoder to capture long-range and multi-level dependencies across an input sequence. According to the extensive experiments on the two datasets of IAM Handwriting and Kuzushiji, the proposed model achieves better accuracy than the state-of-the-art models. The self-attention map visualization shows that the self-attention mechanism helps the encoder capture long-range and multi-level dependencies across an input sequence.
引用
下载
收藏
页码:356 / 369
页数:14
相关论文
共 50 条
  • [1] Gated Convolution and Stacked Self-Attention Encoder-Decoder-Based Model for Offline Handwritten Ethiopic Text Recognition
    Tadesse, Direselign Addis
    Liu, Chuan-Ming
    Ta, Van-Dai
    INFORMATION, 2023, 14 (12)
  • [2] 2D Self-attention Convolutional Recurrent Network for Offline Handwritten Text Recognition
    Ly, Nam Tuan
    Nguyen, Hung Tuan
    Nakagawa, Masaki
    DOCUMENT ANALYSIS AND RECOGNITION - ICDAR 2021, PT I, 2021, 12821 : 191 - 204
  • [3] Offline Handwritten Text Recognition Based on CTC-Attention
    Ma Yangyang
    Xiao Bing
    LASER & OPTOELECTRONICS PROGRESS, 2021, 58 (12)
  • [4] Self-attention Networks for Non-recurrent Handwritten Text Recognition
    d'Arce, Rafael
    Norton, Terence
    Hannuna, Sion
    Cristianini, Nello
    FRONTIERS IN HANDWRITING RECOGNITION, ICFHR 2022, 2022, 13639 : 389 - 403
  • [5] An attention based method for offline handwritten L rdu text recognition
    Anjum, Tayaba
    Khan, Nazar
    2020 17TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR 2020), 2020, : 169 - 174
  • [6] Offline Recognition of Malayalam Handwritten Text
    Shanjana, C.
    James, Ajay
    8TH INTERNATIONAL CONFERENCE INTERDISCIPLINARITY IN ENGINEERING, INTER-ENG 2014, 2015, 19 : 772 - 779
  • [7] A Text Sentiment Analysis Model Based on Self-Attention Mechanism
    Ji, Likun
    Gong, Ping
    Yao, Zhuyu
    2019 THE 3RD INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPILATION, COMPUTING AND COMMUNICATIONS (HP3C 2019), 2019, : 33 - 37
  • [8] Optimizing the integration of a statistical language model in HMM based offline handwritten text recognition
    Zimmermann, M
    Bunke, H
    PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 2, 2004, : 541 - 544
  • [9] HANA: A handwritten name database for offline handwritten text recognition
    Dahl, Christian M.
    Johansen, Torben S. D.
    Sorensen, Emil N.
    Wittrock, Simon
    EXPLORATIONS IN ECONOMIC HISTORY, 2023, 87
  • [10] A Residual-Attention Offline Handwritten Chinese Text Recognition Based on Fully Convolutional Neural Networks
    Wang, Yintong
    Yang, Yingjie
    Ding, Weiping
    Li, Shuo
    IEEE ACCESS, 2021, 9 : 132301 - 132310