Unconstrained Scene Text and Video Text Recognition for Arabic Script

被引:0
|
作者
Jain, Mohit [1 ]
Mathew, Minesh [1 ]
Jawahar, C. V. [1 ]
机构
[1] IIIT Hyderabad, Ctr Visual Informat Technol, Hyderabad, Andhra Pradesh, India
关键词
Arabic; Arabic Scene Text; Arabic Video Text; Synthetic Data; Deep Learning; Text Recognition; CHARACTER-RECOGNITION;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Building robust recognizers for Arabic has always been challenging. We demonstrate the effectiveness of an end-to-end trainable CNN-RNN hybrid architecture in recognizing Arabic text in videos and natural scenes. We outperform previous state-of-the-art on two publicly available video text datasets - ALIF and ACTIV. For the scene text recognition task, we introduce a new Arabic scene text dataset and establish baseline results. For scripts like Arabic, a major challenge in developing robust recognizers is the lack of large quantity of annotated data. We overcome this by synthesizing millions of Arabic text images from a large vocabulary of Arabic words and phrases. Our implementation is built on top of the model introduced here [37] which is proven quite effective for English scene text recognition. The model follows a segmentation-free, sequence to sequence transcription approach. The network transcribes a sequence of convolutional features from the input image to a sequence of target labels. This does away with the need for segmenting input image into constituent characters/glyphs, which is often difficult for Arabic script. Further, the ability of RNNs to model contextual dependencies yields superior recognition results.
引用
收藏
页码:26 / 30
页数:5
相关论文
共 50 条
  • [1] ARASTI: A Database for Arabic Scene Text Recognition
    Tounsi, Maroua
    Moalla, Ikram
    Alimi, Adel M.
    [J]. 2017 1ST INTERNATIONAL WORKSHOP ON ARABIC SCRIPT ANALYSIS AND RECOGNITION (ASAR), 2017, : 140 - 144
  • [2] Video Scene Text Frames Categorization for Text Detection and Recognition
    Qin, Longfei
    Shivakumara, Palaiahnakote
    Lu, Tong
    Pal, Umapada
    Tan, Chew Lim
    [J]. 2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 3886 - 3891
  • [3] Unconstrained Arabic Scene Text Analysis using Concurrent Invariant Points
    Ahmed, Saad Bin
    Naz, Saeeda
    Razzak, Imran
    Prasad, Mukesh
    [J]. 2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [4] Detection and Recognition of Arabic Text in Video Frames
    Ohyama, Wataru
    Iwata, Seiya
    Wakabayashi, Tetsushi
    Kimura, Fumitaka
    [J]. 2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR 2017), VOL 7, 2017, : 20 - 24
  • [5] Multi-Script-Oriented Text Detection and Recognition in Video/Scene/Born Digital Images
    Raghunandan, K. S.
    Shivakumara, Palaiahnakote
    Roy, Sangheeta
    Kumar, G. Hemantha
    Pal, Umapada
    Lu, Tong
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2019, 29 (04) : 1145 - 1162
  • [6] SARN: Script-Aware Recognition Network for scene multilingual text recognition
    Ke, Wenjun
    Hou, Qingzhi
    Liu, Yutian
    Song, Xinyue
    Wei, Jianguo
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2024, 250
  • [7] A Novel Minimal Arabic Script for Preparing Databases and Benchmarks for Arabic Text Recognition Research
    Al-Muhtaseb, Husni A.
    Mahmoud, Sabri A.
    Qahwaji, Rami S.
    [J]. SIGNAL PROCESSING SYSTEMS, 2009, : 37 - +
  • [8] Arabic Cursive Text Recognition from Natural Scene Images
    Bin Ahmed, Saad
    Naz, Saeeda
    Razzak, Muhammad Imran
    Yusof, Rubiyah
    [J]. APPLIED SCIENCES-BASEL, 2019, 9 (02):
  • [9] Text detection, recognition, and script identification in natural scene images: a Review
    Veronica Naosekpam
    Nilkanta Sahu
    [J]. International Journal of Multimedia Information Retrieval, 2022, 11 : 291 - 314
  • [10] Text detection, recognition, and script identification in natural scene images: a Review
    Naosekpam, Veronica
    Sahu, Nilkanta
    [J]. INTERNATIONAL JOURNAL OF MULTIMEDIA INFORMATION RETRIEVAL, 2022, 11 (03) : 291 - 314