Recognizing handwritten Arabic words using grapheme segmentation and recurrent neural networks

被引:23
|
作者
Abandah, Gheith A. [1 ]
Jamour, Fuad T. [2 ]
Qaralleh, Esam A. [3 ]
机构
[1] Univ Jordan, Dept Comp Engn, Amman 11942, Jordan
[2] King Abdullah Univ Sci & Technol, Thuwal, Saudi Arabia
[3] Princess Sumaya Univ Technol, Amman, Jordan
关键词
Optical character recognition; Handwritten Arabic words; Segmentation; Feature evaluation and selection; Recurrent neural networks; CHARACTER-RECOGNITION; FEATURE-SELECTION; ALGORITHM;
D O I
10.1007/s10032-014-0218-7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The Arabic alphabet is used in around 27 languages, including Arabic, Persian, Kurdish, Urdu, and Jawi. Many researchers have developed systems for recognizing cursive handwritten Arabic words, using both holistic and segmentation-based approaches. This paper introduces a system that achieves high accuracy using efficient segmentation, feature extraction, and recurrent neural network (RNN). We describe a robust rule-based segmentation algorithm that uses special feature points identified in the word skeleton to segment the cursive words into graphemes. We show that careful selection from a wide range of features extracted during and after the segmentation stage produces a feature set that significantly reduces the label error. We demonstrate that using same RNN recognition engine, the segmentation approach with efficient feature extraction gives better results than a holistic approach that extracts features from raw pixels. We evaluated this segmentation approach against an improved version of the holistic system MDLSTM that won the ICDAR 2009 Arabic handwritten word recognition competition. On the IfN/ENIT database of handwritten Arabic words, the segmentation approach reduces the average label error by 18.5 %, the sequence error by 22.3 %, and the execution time by 31 %, relative to MDLSTM. This approach also has the best published accuracies on two IfN/ENIT test sets.
引用
收藏
页码:275 / 291
页数:17
相关论文
共 50 条
  • [1] Recognizing handwritten Arabic words using grapheme segmentation and recurrent neural networks
    Gheith A. Abandah
    Fuad T. Jamour
    Esam A. Qaralleh
    [J]. International Journal on Document Analysis and Recognition (IJDAR), 2014, 17 : 275 - 291
  • [2] Recognizing Handwritten Devanagari Words Using Recurrent Neural Network
    Oval, Sonali G.
    Shirawale, Sankirti
    [J]. PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON FRONTIERS OF INTELLIGENT COMPUTING: THEORY AND APPLICATIONS (FICTA) 2014, VOL 2, 2015, 328 : 413 - 421
  • [3] Combining convolutional neural networks with SVM classifier for recognizing Persian and Arabic handwritten words
    Shahram Golzari
    Abdullah Khalili
    Rasool Sabzi
    [J]. Multimedia Tools and Applications, 2022, 81 : 33785 - 33799
  • [4] Combining convolutional neural networks with SVM classifier for recognizing Persian and Arabic handwritten words
    Golzari, Shahram
    Khalili, Abdullah
    Sabzi, Rasool
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (23) : 33785 - 33799
  • [5] Recognizing Arabic handwritten words using multiple features and classifier selection
    Aiadi, Oussama
    Korichi, Aicha
    Kherfi, Mohammed Lamine
    [J]. 2019 4TH INTERNATIONAL CONFERENCE ON NETWORKING AND ADVANCED SYSTEMS (ICNAS 2019), 2019, : 106 - 110
  • [6] Recognizing Persian Handwritten Words Using Deep Convolutional Networks
    Sabzi, Rasool
    Fotoohinya, Zahra
    Salkhorde, Zeinab
    Khalili, Abdullah
    Golzari, Shahram
    Behravesh, Sajjad
    Akbarpour, Shahin
    [J]. 2017 19TH CSI INTERNATIONAL SYMPOSIUM ON ARTIFICIAL INTELLIGENCE AND SIGNAL PROCESSING (AISP), 2017, : 85 - 90
  • [7] An Improved Segmentation of Online English Handwritten Text Using Recurrent Neural Networks
    Cuong Tuan Nguyen
    Nakagawa, Masaki
    [J]. PROCEEDINGS 3RD IAPR ASIAN CONFERENCE ON PATTERN RECOGNITION ACPR 2015, 2015, : 176 - 180
  • [8] Recognition of Persian/Arabic Handwritten Words Using a Combination of Convolutional Neural Networks and Autoencoder (AECNN)
    Khosravi, Sara
    Chalechale, Abdolah
    [J]. MATHEMATICAL PROBLEMS IN ENGINEERING, 2022, 2022
  • [9] Chimp Optimization Algorithm to Optimize a Convolutional Neural Network for Recognizing Persian/Arabic Handwritten Words
    Khosravi, Sara
    Chalechale, Abdolah
    [J]. MATHEMATICAL PROBLEMS IN ENGINEERING, 2022, 2022
  • [10] Chimp Optimization Algorithm to Optimize a Convolutional Neural Network for Recognizing Persian/Arabic Handwritten Words
    Khosravi, Sara
    Chalechale, Abdolah
    [J]. MATHEMATICAL PROBLEMS IN ENGINEERING, 2022, 2022