Unconstrained Scene Text and Video Text Recognition for Arabic Script

被引:0
|
作者
Jain, Mohit [1 ]
Mathew, Minesh [1 ]
Jawahar, C. V. [1 ]
机构
[1] IIIT Hyderabad, Ctr Visual Informat Technol, Hyderabad, Andhra Pradesh, India
关键词
Arabic; Arabic Scene Text; Arabic Video Text; Synthetic Data; Deep Learning; Text Recognition; CHARACTER-RECOGNITION;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Building robust recognizers for Arabic has always been challenging. We demonstrate the effectiveness of an end-to-end trainable CNN-RNN hybrid architecture in recognizing Arabic text in videos and natural scenes. We outperform previous state-of-the-art on two publicly available video text datasets - ALIF and ACTIV. For the scene text recognition task, we introduce a new Arabic scene text dataset and establish baseline results. For scripts like Arabic, a major challenge in developing robust recognizers is the lack of large quantity of annotated data. We overcome this by synthesizing millions of Arabic text images from a large vocabulary of Arabic words and phrases. Our implementation is built on top of the model introduced here [37] which is proven quite effective for English scene text recognition. The model follows a segmentation-free, sequence to sequence transcription approach. The network transcribes a sequence of convolutional features from the input image to a sequence of target labels. This does away with the need for segmenting input image into constituent characters/glyphs, which is often difficult for Arabic script. Further, the ability of RNNs to model contextual dependencies yields superior recognition results.
引用
收藏
页码:26 / 30
页数:5
相关论文
共 50 条
  • [31] Text-Level Contrastive Learning for Scene Text Recognition
    Zhuang, Junbin
    Ren, Yixuan
    Li, Xia
    Liang, Zhanpeng
    [J]. 2022 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP 2022), 2022, : 231 - 236
  • [32] SCENE TEXT RECOGNITION IN MULTIPLE FRAMES BASED ON TEXT TRACKING
    Rong, Xuejian
    Yi, Chucai
    Yang, Xiaodong
    Tian, Yingli
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2014,
  • [33] Segmentation of Arabic Text into Characters for Recognition
    Shaikh, Noor Ahmed
    Shaikh, Zubair Ahmed
    Ali, Ghulam
    [J]. WIRELESS NETWORKS, INFORMATION PROCESSING AND SYSTEMS, 2008, 20 : 11 - +
  • [34] Offline arabic text recognition system
    Sarfraz, M
    Nawaz, SN
    Al-Khuraidly, A
    [J]. 2003 INTERNATIONAL CONFERENCE ON GEOMETRIC MODELING AND GRAPHICS, PROCEEDINGS, 2003, : 30 - 35
  • [35] Unconstrained licence plate and text localization and recognition
    Matas, J
    Zimmermann, K
    [J]. 2005 IEEE INTELLIGENT TRANSPORTATION SYSTEMS CONFERENCE (ITSC), 2005, : 572 - 577
  • [36] Arabic and Latin Scene Text Recognition by Combining Handcrafted and Deep-Learned Features
    Tounsi, Maroua
    Moalla, Ikram
    Pal, Umapada
    Alimi, Adel M.
    [J]. ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2022, 47 (08) : 9727 - 9740
  • [37] Text Recognition for Nepalese Manuscripts in Pracalit Script
    O'neill, Alexander James
    Hill, Nathan
    [J]. JOURNAL OF OPEN HUMANITIES DATA, 2022, 8
  • [38] Arabic and Latin Scene Text Recognition by Combining Handcrafted and Deep-Learned Features
    Maroua Tounsi
    Ikram Moalla
    Umapada Pal
    Adel M. Alimi
    [J]. Arabian Journal for Science and Engineering, 2022, 47 : 9727 - 9740
  • [39] Automatic text segmentation and text recognition for video indexing
    Lienhart, R
    Effelsberg, W
    [J]. MULTIMEDIA SYSTEMS, 2000, 8 (01) : 69 - 81
  • [40] Open Datasets and Tools for Arabic Text Detection and Recognition in News Video Frames
    Zayene, Oussama
    Touj, Sameh Masmoudi
    Hennebert, Jean
    Ingold, Rolf
    Ben Amara, Najoua Essoukri
    [J]. JOURNAL OF IMAGING, 2018, 4 (02)