CNN-BLSTM Model for Arabic Text Recognition in Unconstrained Captured Identity Documents

被引:0
|
作者
Ghanmi, Nabil [1 ]
Belhakimi, Amine [1 ]
Awal, Ahmad-Montaser [1 ]
机构
[1] IDNOW, AI&ML Ctr Excellence, Rennes, France
关键词
Arabic Text Recognition; Identity Document; Convolutional Neural Network; Long Short-Term Memory; Connectionsit Temporal Classification; Character Error Rate;
D O I
10.1007/978-3-031-51023-6_10
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Optical Character Recognition (OCR) for Arabic text (printed and handwritten) has been widely studied by researchers in the last two decades. Some commercial solutions have emerged with good recognition rates for printed text (on white or uniform backgrounds) or handwritten text with limited vocabulary. In addition to being naturally cursive, the Arabic language comes with additional challenges due to its calligraphy resulting in a variety of fonts and styles. In this work, recent advances in recurrent neural networks are explored for the recognition of Arabic text in identity documents captured in the wild. The unconstrained captures bring additional difficulties as the text has to be first localized before being able to recognize it. Various pre-processing steps are introduced to overcome the difficulties related to the Arabic text itself and also due to the capturing conditions. The presented approach outperforms existing solutions when evaluated using a private dataset and also using the recent MIDV2020 dataset.
引用
收藏
页码:106 / 118
页数:13
相关论文
共 50 条
  • [1] Offline Handwritten Text Recognition Using Hybrid CNN-BLSTM Network
    Namdeo, Rahul Kumar
    Gupta, Chetan
    Shrivastava, Ritu
    Proceedings - 2022 IEEE 11th International Conference on Communication Systems and Network Technologies, CSNT 2022, 2022, : 318 - 323
  • [2] Enhancing Arabic Handwritten Recognition System Based CNN-BLSTM Using Generative Adversarial Networks
    Rabi, Mouhcine
    Amrouche, Mustapha
    ARABIC LANGUAGE PROCESSING: FROM THEORY TO PRACTICE, ICALP 2023, PT II, 2025, 2340 : 140 - 153
  • [3] A hybrid Algorithm for Text classification Based on CNN-BLSTM with Attention
    Fu, Lei
    Yin, ZhaoXia
    Wang, Xin
    Liu, Yi
    2018 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP), 2018, : 31 - 34
  • [4] Gender-Aware CNN-BLSTM for Speech Emotion Recognition
    Zhang, Linjuan
    Wang, Longbiao
    Dang, Jianwu
    Guo, Lili
    Yu, Qiang
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2018, PT I, 2018, 11139 : 782 - 790
  • [5] AHYBRID MODEL FOR ARABIC SCRIPT RECOGNITION BASED ON CNN-CBAMAND BLSTM
    Dahbali, Mohamed
    Aboutabit, Noureddine
    Lamghari, Nidal
    JORDANIAN JOURNAL OF COMPUTERS AND INFORMATION TECHNOLOGY, 2024, 10 (03): : 294 - 305
  • [6] Speaker-Independent Speech Emotion Recognition Based on CNN-BLSTM and Multiple SVMs
    Liu, Zhen-Tao
    Xiao, Peng
    Li, Dan-Yun
    Hao, Man
    INTELLIGENT ROBOTICS AND APPLICATIONS, ICIRA 2019, PT III, 2019, 11742 : 481 - 491
  • [7] Document Recognition and Translation System for Unconstrained Arabic Documents
    Cao, Huaigu
    Chen, Jinying
    Devlin, Jacob
    Prasad, Rohit
    Natarajan, Prem
    2012 21ST INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR 2012), 2012, : 318 - 321
  • [8] Deep BLSTM Neural Networks for Unconstrained Continuous Handwritten Text Recognition
    Frinken, Volkmar
    Uchida, Seiichi
    2015 13TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2015, : 911 - 915
  • [9] Unconstrained Scene Text and Video Text Recognition for Arabic Script
    Jain, Mohit
    Mathew, Minesh
    Jawahar, C. V.
    2017 1ST INTERNATIONAL WORKSHOP ON ARABIC SCRIPT ANALYSIS AND RECOGNITION (ASAR), 2017, : 26 - 30
  • [10] Research on Entity Relationship Extraction Model of Food Public Opinion Based on CNN-BLSTM
    Wang Q.
    Wang H.
    Zuo M.
    Zhang Q.
    Wen X.
    Yuan Y.
    Journal of Food Science and Technology (China), 2021, 39 (02): : 152 - 158