Handwritten text recognition and information extraction from ancient manuscripts using deep convolutional and recurrent neural network

被引:0
|
作者
El Bahi, Hassan [1 ]
机构
[1] L2IS, Laboratory of Computer and Systems Engineering, Cadi Ayyad University, B.P. 511, Marrakech,40000, Morocco
关键词
Deep neural networks - Long short-term memory - Multilayer neural networks - Palmprint recognition;
D O I
10.1007/s00500-024-09930-6
中图分类号
学科分类号
摘要
Digitizing ancient manuscripts and making them accessible to a broader audience is a crucial step in unlocking the wealth of information they hold. However, automatic recognition of handwritten text and the extraction of relevant information such as named entities from these manuscripts are among the most difficult research topics, due to several factors such as poor quality of manuscripts, complex background, presence of ink stains, cursive handwriting, etc. To meet these challenges, we propose two systems, the first system performs the task of handwritten text recognition (HTR) in ancient manuscripts; it starts with a preprocessing operation. Then, a convolutional neural network (CNN) is used to extract the features of each input image. Finally, a recurrent neural network (RNN) which has Long Short-Term Memory (LSTM) blocks with the Connectionist Temporal Classification (CTC) layer will predict the text contained in the image. The second system focuses on recognizing named entities and deciphering the relationships among words directly from images of old manuscripts, bypassing the need for an intermediate text transcription step. Like the previous system, this second system starts with a preprocessing step. Then the data augmentation technique is used to increase the training dataset. After that, the extraction of the most relevant features is done automatically using a CNN model. Finally, the recognition of names entities and the relationship between word images is performed using a bidirectional LSTM. Extensive experiments on the ESPOSALLES dataset demonstrate that the proposed systems achieve the state-of-the-art performance exceeding existing systems. © The Author(s), under exclusive licence to Springer-Verlag GmbH Germany, part of Springer Nature 2024.
引用
收藏
页码:12249 / 12268
页数:19
相关论文
共 50 条
  • [1] A DEEP CONVOLUTIONAL NEURAL NETWORK FOR CHARACTER RECOGNITION IN ANCIENT SYRIAC MANUSCRIPTS
    Fermanian, Rita
    Yaacoub, Charles
    Akl, Adib
    Bilane, Petra
    2022 IEEE 24TH INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2022,
  • [2] Fully Convolutional Recurrent Network for Handwritten Chinese Text Recognition
    Xie, Zecheng
    Sun, Zenghui
    Jin, Lianwen
    Feng, Ziyong
    Zhang, Shuye
    2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 4011 - 4016
  • [3] Text Baseline Recognition Using a Recurrent Convolutional Neural Network
    Woedlinger, Matthias
    Sablatnig, Robert
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 4673 - 4679
  • [4] Bengali Handwritten Character Recognition Using Deep Convolutional Neural Network
    Purkaystha, Bishwajit
    Datta, Tapos
    Islam, Md Saiful
    2017 20TH INTERNATIONAL CONFERENCE OF COMPUTER AND INFORMATION TECHNOLOGY (ICCIT), 2017,
  • [5] Deep Convolutional Recurrent Network for Segmentation-free Offline Handwritten Japanese Text Recognition
    Nam-Tuan Ly
    Cuong-Tuan Nguyen
    Kha-Cong Nguyen
    Nakagawa, Masaki
    2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR 2017), VOL 7, 2017, : 5 - 9
  • [6] Cursive Text Recognition in Natural Scene Images Using Deep Convolutional Recurrent Neural Network
    Chandio, Asghar Ali
    Asikuzzaman, MD.
    Pickering, Mark R.
    Leghari, Mehwish
    IEEE ACCESS, 2022, 10 : 10062 - 10078
  • [7] Attention Augmented Convolutional Recurrent Network for Handwritten Japanese Text Recognition
    Ly, Nam Tuan
    Nguyen, Cuong Tuan
    Nakagawa, Masaki
    2020 17TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR 2020), 2020, : 163 - 168
  • [8] Segmented Handwritten Text Recognition with Recurrent Neural Network Classifiers
    Sui, Bolan
    Zhang, Xi
    Lui, Shijian
    Tan, Chew Lim
    2015 13TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2015, : 386 - 390
  • [9] Scene text recognition using residual convolutional recurrent neural network
    Lei, Zhengchao
    Zhao, Sanyuan
    Song, Hongmei
    Shen, Jianbing
    MACHINE VISION AND APPLICATIONS, 2018, 29 (05) : 861 - 871
  • [10] Scene text recognition using residual convolutional recurrent neural network
    Zhengchao Lei
    Sanyuan Zhao
    Hongmei Song
    Jianbing Shen
    Machine Vision and Applications, 2018, 29 : 861 - 871