Unconstrained OCR for Urdu using Deep CNN-RNN Hybrid Networks

被引:14
|
作者
Jain, Mohit [1 ]
Mathew, Minesh [1 ]
Jawahar, C. V. [1 ]
机构
[1] IIIT Hyderabad, Ctr Visual Informat Technol, Hyderabad, India
关键词
OCR; Urdu OCR; Deep Learning; Urdu Dataset; Text Recognition;
D O I
10.1109/ACPR.2017.5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Building robust text recognition systems for languages with cursive scripts like Urdu has always been challenging. Intricacies of the script and the absence of ample annotated data further act as adversaries to this task. We demonstrate the effectiveness of an end-to-end trainable hybrid CNN-RNN architecture in recognizing Urdu text from printed documents, typically known as Urdu OCR. The solution proposed is not bounded by any language specific lexicon with the model following a segmentation-free, sequence-to-sequence transcription approach. The network transcribes a sequence of convolutional features from an input image to a sequence of target labels. This discards the need to segment the input image into its constituent characters/glyphs, which is often arduous for scripts like Urdu. Furthermore, past and future contexts modelled by bidirectional recurrent layers aids the transcription. We outperform previous state-of-the-art techniques on the synthetic UPTI dataset. Additionally, we publish a new dataset curated by scanning printed Urdu publications in various writing styles and fonts, annotated at the line level. We also provide benchmark results of our model on this dataset.
引用
收藏
页码:747 / 752
页数:6
相关论文
共 50 条
  • [1] Improving CNN-RNN Hybrid Networks for Handwriting Recognition
    Dutta, Kartik
    Krishnan, Praveen
    Mathew, Minesh
    Jawahar, C. V.
    [J]. PROCEEDINGS 2018 16TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR), 2018, : 80 - 85
  • [2] Surgical Tool Segmentation Using A Hybrid Deep CNN-RNN Auto Encoder-Decoder
    Attia, Mohamed
    Hossny, Mohammed
    Nahavandi, Saeid
    Asadi, Hamed
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2017, : 3373 - 3378
  • [3] License Plate Localization in Unconstrained Scenes Using a Two-Stage CNN-RNN
    Zhang, Jingjing
    Li, Yuanyuan
    Li, Teng
    Xun, Lina
    Shan, Caifeng
    [J]. IEEE SENSORS JOURNAL, 2019, 19 (13) : 5256 - 5265
  • [4] Enhancing Skin Cancer Diagnosis with Deep Learning: A Hybrid CNN-RNN Approach
    Zareen, Syeda Shamaila
    Sun, Guangmin
    Kundi, Mahwish
    Qadri, Syed Furqan
    Qadri, Salman
    [J]. CMC-COMPUTERS MATERIALS & CONTINUA, 2024, 79 (01): : 1497 - 1519
  • [5] Detection of fake news using deep learning CNN-RNN based methods
    Sastrawan, I. Kadek
    Bayupati, I. P. A.
    Arsa, Dewa Made Sri
    [J]. ICT EXPRESS, 2022, 8 (03): : 396 - 408
  • [6] Video-Based Emotion Recognition using CNN-RNN and C3D Hybrid Networks
    Fan, Yin
    Lu, Xiangju
    Li, Dian
    Liu, Yuanliu
    [J]. ICMI'16: PROCEEDINGS OF THE 18TH ACM INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, 2016, : 445 - 450
  • [7] Predicting Beijing Air Quality Using Bayesian Optimized CNN-RNN Hybrid Model
    Tu, Zihan
    Wu, Zhe
    [J]. 2022 ASIA CONFERENCE ON ALGORITHMS, COMPUTING AND MACHINE LEARNING (CACML 2022), 2022, : 581 - 587
  • [8] An Integrated Hybrid CNN-RNN Model for Visual Description and Generation of Captions
    Khamparia, Aditya
    Pandey, Babita
    Tiwari, Shrasti
    Gupta, Deepak
    Khanna, Ashish
    Rodrigues, Joel J. P. C.
    [J]. CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2020, 39 (02) : 776 - 788
  • [9] An OpenCL-Based Hybrid CNN-RNN Inference Accelerator On FPGA
    Sun, Yunfei
    Liu, Brian
    Xu, Xianchao
    [J]. 2019 INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE TECHNOLOGY (ICFPT 2019), 2019, : 283 - 286
  • [10] Human Abnormality Classification Using Combined CNN-RNN Approach
    Kabir, Mohsin
    Safir, Farisa Benta
    Shahen, Saifullah
    Maua, Jannatul
    Awlad, Iffat Ara Binte
    Mridha, M. F.
    [J]. 2020 IEEE 17TH INTERNATIONAL CONFERENCE ON SMART COMMUNITIES: IMPROVING QUALITY OF LIFE USING ICT, IOT AND AI (IEEEHONET 2020), 2020, : 204 - 208