Unconstrained OCR for Urdu using Deep CNN-RNN Hybrid Networks

被引:14
|
作者
Jain, Mohit [1 ]
Mathew, Minesh [1 ]
Jawahar, C. V. [1 ]
机构
[1] IIIT Hyderabad, Ctr Visual Informat Technol, Hyderabad, India
关键词
OCR; Urdu OCR; Deep Learning; Urdu Dataset; Text Recognition;
D O I
10.1109/ACPR.2017.5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Building robust text recognition systems for languages with cursive scripts like Urdu has always been challenging. Intricacies of the script and the absence of ample annotated data further act as adversaries to this task. We demonstrate the effectiveness of an end-to-end trainable hybrid CNN-RNN architecture in recognizing Urdu text from printed documents, typically known as Urdu OCR. The solution proposed is not bounded by any language specific lexicon with the model following a segmentation-free, sequence-to-sequence transcription approach. The network transcribes a sequence of convolutional features from an input image to a sequence of target labels. This discards the need to segment the input image into its constituent characters/glyphs, which is often arduous for scripts like Urdu. Furthermore, past and future contexts modelled by bidirectional recurrent layers aids the transcription. We outperform previous state-of-the-art techniques on the synthetic UPTI dataset. Additionally, we publish a new dataset curated by scanning printed Urdu publications in various writing styles and fonts, annotated at the line level. We also provide benchmark results of our model on this dataset.
引用
下载
收藏
页码:747 / 752
页数:6
相关论文
共 50 条
  • [11] Human Abnormality Classification Using Combined CNN-RNN Approach
    Kabir, Mohsin
    Safir, Farisa Benta
    Shahen, Saifullah
    Maua, Jannatul
    Awlad, Iffat Ara Binte
    Mridha, M. F.
    2020 IEEE 17TH INTERNATIONAL CONFERENCE ON SMART COMMUNITIES: IMPROVING QUALITY OF LIFE USING ICT, IOT AND AI (IEEEHONET 2020), 2020, : 204 - 208
  • [12] Handwritten Odia numeral recognition using combined CNN-RNN
    Das, Abhishek
    Mohanty, Mihir Narayan
    INTERNATIONAL JOURNAL OF GRID AND UTILITY COMPUTING, 2023, 14 (04) : 382 - 388
  • [13] Improving Hate Speech Detection Accuracy using Hybrid CNN-RNN and Random Oversampling Techniques
    Riyadi, Slamet
    Andriyani, Annisa Divayu
    Masyhur, Ahmad Musthafa
    2024 IEEE SYMPOSIUM ON INDUSTRIAL ELECTRONICS AND APPLICATIONS, ISIEA 2024, 2024,
  • [14] Categorization of actions in soccer videos using a CNN-RNN architecture
    Macedo, Matheus de Sousa
    Adamatti, Diana Francisca
    REVISTA BRASILEIRA DE COMPUTACAO APLICADA, 2023, 15 (03): : 1 - 14
  • [15] Biomedical Named Entity Recognition Based on Hybrid Multistage CNN-RNN Learner
    Phan, Robert
    Luu, Thoai Man
    Davey, Rachel
    Chetty, Girija
    2018 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND DATA ENGINEERING (ICMLDE 2018), 2018, : 128 - 135
  • [16] Video Emotion Recognition Using Local Enhanced Motion History Image and CNN-RNN Networks
    Wang, Haowen
    Zhou, Guoxiang
    Hu, Min
    Wang, Xiaohua
    BIOMETRIC RECOGNITION, CCBR 2018, 2018, 10996 : 109 - 119
  • [17] The assessment of 3D model representation for retrieval with CNN-RNN networks
    Weizhi Nie
    Kun Wang
    Hongtao Wang
    Yuting Su
    Multimedia Tools and Applications, 2019, 78 : 16979 - 16994
  • [18] Enhanced CNN-RNN deep learning-based framework for the detection of glaucoma
    Veena, H. N.
    Muruganandham, A.
    Kumaran, T. Senthil
    INTERNATIONAL JOURNAL OF BIOMEDICAL ENGINEERING AND TECHNOLOGY, 2021, 36 (02) : 133 - 147
  • [19] Detection of Deepfake Media Using a Hybrid CNN-RNN Model and Particle Swarm Optimization (PSO) Algorithm
    Al-Adwan, Aryaf
    Alazzam, Hadeel
    Al-Anbaki, Noor
    Alduweib, Eman
    COMPUTERS, 2024, 13 (04)
  • [20] The assessment of 3D model representation for retrieval with CNN-RNN networks
    Nie, Weizhi
    Wang, Kun
    Wang, Hongtao
    Su, Yuting
    MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (12) : 16979 - 16994