High Performance Urdu and Arabic Video Text Recognition Using Convolutional Recurrent Neural Networks

被引:2
|
作者
Rehman, Abdul [1 ]
Ul-Hasan, Adnan [2 ]
Shafait, Faisal [1 ,2 ]
机构
[1] Natl Univ Sci & Technol NUST, Sch Elect Engn & Comp Sci, Islamabad, Pakistan
[2] Natl Ctr Artificial Intelligence, Deep Learning Lab, Lahore, Pakistan
关键词
Urdu; Arabic; Video text recognition; CRNN;
D O I
10.1007/978-3-030-86198-8_24
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Text extraction from videos is an emerging research field in the document analysis community. We propose a simple Convolutional Recurrent Neural Network to perform text recognition on both Arabic and Urdu scripts. We use a large variety of data augmentation techniques to generalize the model and prevent over-fitting. We also use a slightly improved loss function that helps the model converge faster. Using the proposed method we achieved 99.73% CRR, 88.37% WRR and 89.92% LRR on the Urdu Ticker Text dataset and 96.82% CRR, 90.41% WRR and 76.78% LRR on the AcTiVComp20 dataset. The proposed method has significantly outperformed Google Vision API on both of the datasets.
引用
收藏
页码:336 / 352
页数:17
相关论文
共 50 条
  • [1] Ligature Recognition in Urdu Caption Text using Deep Convolutional Neural Networks
    Hayat, Umar
    Aatif, Muhammad
    Zeeshan, Osama
    Siddiqi, Imran
    [J]. 2018 14TH INTERNATIONAL CONFERENCE ON EMERGING TECHNOLOGIES (ICET), 2018,
  • [2] Arabic Video Text Recognition Based on Multi-Dimensional Recurrent Neural Networks
    Zayene, Oussama
    Amamou, Soumaya Essefi
    BenAmara, Najoua Essoukri
    [J]. 2017 IEEE/ACS 14TH INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS (AICCSA), 2017, : 725 - 729
  • [3] Recognition of printed Urdu ligatures using convolutional neural networks
    Uddin, Israr
    Javed, Nizwa
    Siddiqi, Imran
    Khalid, Shehzad
    Khurshid, Khurram
    [J]. JOURNAL OF ELECTRONIC IMAGING, 2019, 28 (03)
  • [4] Robust Arabic Text Categorization by Combining Convolutional and Recurrent Neural Networks
    Ameur, Mohamed Seghir Hadj
    Belkebir, Riadh
    Guessoum, Ahmed
    [J]. ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2020, 19 (05)
  • [5] Arabic Text Generation Using Recurrent Neural Networks
    Souri, Adnan
    El Maazouzi, Zakaria
    Al Achhab, Mohammed
    Eddine El Mohajir, Badr
    [J]. BIG DATA, CLOUD AND APPLICATIONS, BDCA 2018, 2018, 872 : 523 - 533
  • [6] Recognition of printed Arabic text using neural networks
    Amin, A
    Mansoor, W
    [J]. PROCEEDINGS OF THE FOURTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS 1 AND 2, 1997, : 612 - 615
  • [7] Urdu Natural Scene Character Recognition using Convolutional Neural Networks
    Ali, Asghar
    Pickering, Mark
    Shafi, Kamran
    [J]. 2018 IEEE 2ND INTERNATIONAL WORKSHOP ON ARABIC AND DERIVED SCRIPT ANALYSIS AND RECOGNITION (ASAR), 2018, : 29 - 34
  • [8] Arabic speech recognition using recurrent neural networks
    El Choubassi, MM
    El Khoury, HE
    Alagha, CEJ
    Skaf, JA
    Al-Alaoui, MA
    [J]. PROCEEDINGS OF THE 3RD IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY, 2003, : 543 - 547
  • [9] Deep sentiments in Roman Urdu text using Recurrent Convolutional Neural Network model
    Mahmood, Zainab
    Safder, Iqra
    Nawab, Rao Muhammad Adeel
    Bukhari, Faisal
    Nawaz, Raheel
    Alfakeeh, Ahmed S.
    Aljohani, Naif Radi
    Hassan, Saeed-Ul
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2020, 57 (04)
  • [10] Automatic diacritization of Arabic text using recurrent neural networks
    Gheith A. Abandah
    Alex Graves
    Balkees Al-Shagoor
    Alaa Arabiyat
    Fuad Jamour
    Majid Al-Taee
    [J]. International Journal on Document Analysis and Recognition (IJDAR), 2015, 18 : 183 - 197