High Performance Urdu and Arabic Video Text Recognition Using Convolutional Recurrent Neural Networks

被引:2
|
作者
Rehman, Abdul [1 ]
Ul-Hasan, Adnan [2 ]
Shafait, Faisal [1 ,2 ]
机构
[1] Natl Univ Sci & Technol NUST, Sch Elect Engn & Comp Sci, Islamabad, Pakistan
[2] Natl Ctr Artificial Intelligence, Deep Learning Lab, Lahore, Pakistan
关键词
Urdu; Arabic; Video text recognition; CRNN;
D O I
10.1007/978-3-030-86198-8_24
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Text extraction from videos is an emerging research field in the document analysis community. We propose a simple Convolutional Recurrent Neural Network to perform text recognition on both Arabic and Urdu scripts. We use a large variety of data augmentation techniques to generalize the model and prevent over-fitting. We also use a slightly improved loss function that helps the model converge faster. Using the proposed method we achieved 99.73% CRR, 88.37% WRR and 89.92% LRR on the Urdu Ticker Text dataset and 96.82% CRR, 90.41% WRR and 76.78% LRR on the AcTiVComp20 dataset. The proposed method has significantly outperformed Google Vision API on both of the datasets.
引用
下载
收藏
页码:336 / 352
页数:17
相关论文
共 50 条
  • [41] SCENE TEXT RECOGNITION WITH DEEPER CONVOLUTIONAL NEURAL NETWORKS
    Zhang, Yuqi
    Wang, Wei
    Wang, Liang
    Wang, Liuan
    2015 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2015, : 2384 - 2388
  • [42] Offline cursive Urdu-Nastaliq script recognition using multidimensional recurrent neural networks
    Naz, Saeeda
    Umar, Arif I.
    Ahmad, Riaz
    Ahmed, Saad B.
    Shirazi, Syed H.
    Siddiqi, Imran
    Razzak, Muhammad I.
    NEUROCOMPUTING, 2016, 177 : 228 - 241
  • [43] DIRECTION FINDING USING CONVOLUTIONAL NEURAL NETWORKS and CONVOLUTIONAL RECURRENT NEURAL NETWORKS
    Uckun, Fehmi Ayberk
    Ozer, Hakan
    Nurbas, Ekin
    Onat, Emrah
    2020 28TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2020,
  • [44] Recognition of Urdu Handwritten Alphabet Using Convolutional Neural Network (CNN)
    Ahmed, Gulzar
    Alyas, Tahir
    Iqbal, Muhammad Waseem
    Ashraf, Muhammad Usman
    Alghamdi, Ahmed Mohammed
    Bahaddad, Adel A.
    Almarhabi, Khalid Ali
    CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 73 (02): : 2967 - 2984
  • [45] Scene Text Script Identification with Convolutional Recurrent Neural Networks
    Mei, Jieru
    Dai, Luo
    Shi, Baoguang
    Bai, Xiang
    2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 4053 - 4058
  • [46] PERFORMANCE OF MLP AND RBF NEURAL NETWORKS ON ARABIC TEXT CATEGORIZATION USING SVD
    Harrag, Fouzi
    Hamdi-Cherif, Aboubekeur
    El-Qawasmeh, Eyas
    NEURAL NETWORK WORLD, 2010, 20 (04) : 441 - 459
  • [47] Protein Fold Recognition From Sequences Using Convolutional and Recurrent Neural Networks
    Villegas-Morcillo, Amelia
    Gomez, Angel M.
    Morales-Cordovilla, Juan A.
    Sanchez, Victoria
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2021, 18 (06) : 2848 - 2854
  • [48] Object Recognition using Cellular Simultaneous Recurrent Networks and Convolutional Neural Network
    Alom, Md Zahangir
    Alam, M.
    Taha, Tarek M.
    Iftekharuddin, K. M.
    2017 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2017, : 2873 - 2880
  • [49] Multi-Domain Deep Convolutional Neural Network for Ancient Urdu Text Recognition System
    Aarif, K. O. Mohammed
    Sivakumar, P.
    INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2022, 33 (01): : 275 - 289
  • [50] Investigating Hybrid Approaches for Arabic Text Diacritization with Recurrent Neural Networks
    Alqudah, Saba'
    Abandah, Gheith
    Arabiyat, Alaa
    2017 IEEE JORDAN CONFERENCE ON APPLIED ELECTRICAL ENGINEERING AND COMPUTING TECHNOLOGIES (AEECT), 2017,