Handwritten text recognition and information extraction from ancient manuscripts using deep convolutional and recurrent neural network

被引:0
|
作者
El Bahi, Hassan [1 ]
机构
[1] L2IS, Laboratory of Computer and Systems Engineering, Cadi Ayyad University, B.P. 511, Marrakech,40000, Morocco
关键词
Deep neural networks - Long short-term memory - Multilayer neural networks - Palmprint recognition;
D O I
10.1007/s00500-024-09930-6
中图分类号
学科分类号
摘要
Digitizing ancient manuscripts and making them accessible to a broader audience is a crucial step in unlocking the wealth of information they hold. However, automatic recognition of handwritten text and the extraction of relevant information such as named entities from these manuscripts are among the most difficult research topics, due to several factors such as poor quality of manuscripts, complex background, presence of ink stains, cursive handwriting, etc. To meet these challenges, we propose two systems, the first system performs the task of handwritten text recognition (HTR) in ancient manuscripts; it starts with a preprocessing operation. Then, a convolutional neural network (CNN) is used to extract the features of each input image. Finally, a recurrent neural network (RNN) which has Long Short-Term Memory (LSTM) blocks with the Connectionist Temporal Classification (CTC) layer will predict the text contained in the image. The second system focuses on recognizing named entities and deciphering the relationships among words directly from images of old manuscripts, bypassing the need for an intermediate text transcription step. Like the previous system, this second system starts with a preprocessing step. Then the data augmentation technique is used to increase the training dataset. After that, the extraction of the most relevant features is done automatically using a CNN model. Finally, the recognition of names entities and the relationship between word images is performed using a bidirectional LSTM. Extensive experiments on the ESPOSALLES dataset demonstrate that the proposed systems achieve the state-of-the-art performance exceeding existing systems. © The Author(s), under exclusive licence to Springer-Verlag GmbH Germany, part of Springer Nature 2024.
引用
收藏
页码:12249 / 12268
页数:19
相关论文
共 50 条
  • [41] Arabic Handwritten Characters Recognition Using Convolutional Neural Network
    AlJarrah, Mohammed N.
    Zyout, Mo'ath M.
    Duwairi, Rehab
    2021 12TH INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION SYSTEMS (ICICS), 2021, : 182 - 188
  • [42] Recognition of Handwritten Devanagari Character using Convolutional Neural Network
    Dokare, Indu
    Gadge, Siddhesh
    Kharde, Kedar
    Bhere, Siddhesh
    Jadhav, Rohit
    ICSPC'21: 2021 3RD INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATION (ICPSC), 2021, : 353 - 359
  • [43] Arabic Handwritten Characters Recognition using Convolutional Neural Network
    Najadat, Hassan M.
    Alshboul, Ahmad A.
    Alabed, Abdullah F.
    2019 10TH INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION SYSTEMS (ICICS), 2019, : 147 - 151
  • [44] Handwritten Mathematical Expression Recognition Using Convolutional Neural Network
    Giang-Son Tran
    Chi-Kien Huynh
    Thanh-Sach Le
    Tan-Phuc Phan
    Khanh-Ngoc Bui
    2018 3RD INTERNATIONAL CONFERENCE ON CONTROL, ROBOTICS AND CYBERNETICS (CRC), 2018, : 15 - 19
  • [45] Handwritten Arabic numerals recognition using convolutional neural network
    Ahamed, Pratik
    Kundu, Soumyadeep
    Khan, Tauseef
    Bhateja, Vikrant
    Sarkar, Ram
    Mollah, Ayatullah Faruk
    JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2020, 11 (11) : 5445 - 5457
  • [46] Text recognition in document images obtained by a smartphone based on deep convolutional and recurrent neural network
    El Bahi, Hassan
    Zatni, Abdelkarim
    MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (18) : 26453 - 26481
  • [47] Text recognition in document images obtained by a smartphone based on deep convolutional and recurrent neural network
    Hassan El Bahi
    Abdelkarim Zatni
    Multimedia Tools and Applications, 2019, 78 : 26453 - 26481
  • [48] Handwritten Chinese Text Recognition Using Separable Multi-Dimensional Recurrent Neural Network
    Wu, Yi-Chao
    Yin, Fei
    Chen, Zhuo
    Liu, Cheng-Lin
    2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), VOL 1, 2017, : 79 - 84
  • [49] Recurrent Neural Network Transducer for Japanese and Chinese Offline Handwritten Text Recognition
    Ngo, Trung Tan
    Nguyen, Hung Tuan
    Ly, Nam Tuan
    Nakagawa, Masaki
    DOCUMENT ANALYSIS AND RECOGNITION, ICDAR 2021, PT II, 2021, 12917 : 364 - 376
  • [50] A Convolutional Neural Network for Handwritten Digit Recognition
    Guevara Neri, Maria Cristina
    Vergara Villegas, Osslan Osiris
    Cruz Sanchez, Vianey Guadalupe
    Nandayapa, Manuel
    Sossa Azuela, Juan Humberto
    INTERNATIONAL JOURNAL OF COMBINATORIAL OPTIMIZATION PROBLEMS AND INFORMATICS, 2020, 11 (01): : 97 - 105