Recognizing handwritten Arabic words using grapheme segmentation and recurrent neural networks

被引:0
|
作者
Gheith A. Abandah
Fuad T. Jamour
Esam A. Qaralleh
机构
[1] The University of Jordan,Computer Engineering Department
[2] King Abdullah University of Science and Technology,undefined
[3] Princess Sumaya University for Technology,undefined
关键词
Optical character recognition; Handwritten Arabic words; Segmentation; Feature evaluation and selection; Recurrent neural networks;
D O I
暂无
中图分类号
学科分类号
摘要
The Arabic alphabet is used in around 27 languages, including Arabic, Persian, Kurdish, Urdu, and Jawi. Many researchers have developed systems for recognizing cursive handwritten Arabic words, using both holistic and segmentation-based approaches. This paper introduces a system that achieves high accuracy using efficient segmentation, feature extraction, and recurrent neural network (RNN). We describe a robust rule-based segmentation algorithm that uses special feature points identified in the word skeleton to segment the cursive words into graphemes. We show that careful selection from a wide range of features extracted during and after the segmentation stage produces a feature set that significantly reduces the label error. We demonstrate that using same RNN recognition engine, the segmentation approach with efficient feature extraction gives better results than a holistic approach that extracts features from raw pixels. We evaluated this segmentation approach against an improved version of the holistic system MDLSTM that won the ICDAR 2009 Arabic handwritten word recognition competition. On the IfN/ENIT database of handwritten Arabic words, the segmentation approach reduces the average label error by 18.5 %, the sequence error by 22.3 %, and the execution time by 31 %, relative to MDLSTM. This approach also has the best published accuracies on two IfN/ENIT test sets.
引用
收藏
页码:275 / 291
页数:16
相关论文
共 50 条
  • [41] Handwritten word recognition using Web resources and recurrent neural networks
    Oprean, Cristina
    Likforman-Sulem, Laurence
    Popescu, Adrian
    Mokbel, Chafic
    [J]. INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2015, 18 (04) : 287 - 301
  • [42] Recognition of Arabic handwritten words using contextual character models
    El-Hajj, Ramy
    Mokbel, Chafic
    Ukforman-Sulm, Laurence
    [J]. DOCUMENT RECOGNITION AND RETRIEVAL XV, 2008, 6815
  • [43] Lip Reading Filipino Words Using Convolutional Neural Networks and Recurrent Neural Networks
    Crespo, Steven David A.
    Recario, Reginald Neil C.
    [J]. INTELLIGENT SYSTEMS AND APPLICATIONS, VOL 2, INTELLISYS 2023, 2024, 823 : 771 - 790
  • [44] Recognizing arabic handwritten characters using deep learning and genetic algorithms
    Balaha, Hossam Magdy
    Ali, Hesham Arafat
    Youssef, Esraa Khaled
    Elsayed, Asmaa Elsayed
    Samak, Reem Adel
    Abdelhaleem, Mohammed Samy
    Tolba, Mohammed Mosa
    Shehata, Mahmoud Ragab
    Mahmoud, Mahmoud Refa'at
    Abdelhameed, Mariam Mahmoud
    Mohammed, Mostafa Mahmoud
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (21-23) : 32473 - 32509
  • [45] Deep Neural Networks for Recognizing Online Handwritten Mathematical Symbols
    Nguyen, Hai Dai
    Le, Anh Duc
    Nakagawa, Masaki
    [J]. PROCEEDINGS 3RD IAPR ASIAN CONFERENCE ON PATTERN RECOGNITION ACPR 2015, 2015, : 121 - 125
  • [46] Recognizing Arabic Handwritten Script using Support Vector Machine classifier
    Elleuch, Mohamed
    Lahiani, Houssem
    Kherallah, Monji
    [J]. 2015 15TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS (ISDA), 2015, : 551 - 556
  • [47] Segmentation of Handwritten Arabic Graphemes Using a Directed Convolutional Neural Network and Mathematical Morphology Operations
    Elkhayati, Mohsine
    Elkettani, Youssfi
    Mourchid, Mohammed
    [J]. PATTERN RECOGNITION, 2022, 122
  • [48] Recognizing handwritten Single Digits and Digit Strings Using Deep Architecture of Neural Networks
    Saabni, Raid
    [J]. 2016 THIRD INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND PATTERN RECOGNITION (AIPR), 2016,
  • [49] Recognizing arabic handwritten characters using deep learning and genetic algorithms
    Hossam Magdy Balaha
    Hesham Arafat Ali
    Esraa Khaled Youssef
    Asmaa Elsayed Elsayed
    Reem Adel Samak
    Mohammed Samy Abdelhaleem
    Mohammed Mosa Tolba
    Mahmoud Ragab Shehata
    Mahmoud Refa’at Mahmoud
    Mariam Mahmoud Abdelhameed
    Mostafa Mahmoud Mohammed
    [J]. Multimedia Tools and Applications, 2021, 80 : 32473 - 32509
  • [50] LOW-RESOURCE GRAPHEME-TO-PHONEME CONVERSION USING RECURRENT NEURAL NETWORKS
    Jyothi, Preethi
    Hasegawa-Johnson, Mark
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 5030 - 5034