Scene Text Script Identification with Convolutional Recurrent Neural Networks

被引:0
|
作者
Mei, Jieru [1 ]
Dai, Luo [2 ]
Shi, Baoguang [2 ]
Bai, Xiang [2 ]
机构
[1] Huazhong Univ Sci & Technol, Sch Automat, Wuhan 430074, Hubei, Peoples R China
[2] Huazhong Univ Sci & Technol, Sch Elect Informat & Commun, Wuhan 430074, Hubei, Peoples R China
基金
中国国家自然科学基金;
关键词
FEATURES;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Script identification for scene text images is a challenging task. This paper describes a novel deep neural network structure that efficiently identifies scripts of images. In our design, we exploit two important factors, namely the image representation, and the spatial dependencies within text lines. To this end, we bring together a Convolutional Neural Network (CNN) and a Recurrent Neural Network (RNN) into one end-to-end trainable network. The former generates rich image representations, while the latter effectively analyzes long-term spatial dependencies. Besides, on top of the structure, we adopt an average pooling structure in order to deal with input images of arbitrary sizes. Experiments on several datasets, including SIW-13 and CVSI2015, demonstrate that our approach achieves superior performance, compared with previous approaches.
引用
收藏
页码:4053 / 4058
页数:6
相关论文
共 50 条
  • [41] DIRECTION FINDING USING CONVOLUTIONAL NEURAL NETWORKS and CONVOLUTIONAL RECURRENT NEURAL NETWORKS
    Uckun, Fehmi Ayberk
    Ozer, Hakan
    Nurbas, Ekin
    Onat, Emrah
    2020 28TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2020,
  • [42] Text detection with convolutional neural networks
    Delakis, Manolis
    Garcia, Christophe
    VISAPP 2008: PROCEEDINGS OF THE THIRD INTERNATIONAL CONFERENCE ON COMPUTER VISION THEORY AND APPLICATIONS, VOL 2, 2008, : 290 - 294
  • [43] Script Identification from Camera-Captured Multi-script Scene Text Components
    Jajoo, Madhuram
    Chakraborty, Neelotpal
    Mollah, Ayatullah Faruk
    Basu, Subhadip
    Sarkar, Ram
    RECENT DEVELOPMENTS IN MACHINE LEARNING AND DATA ANALYTICS, 2019, 740 : 159 - 166
  • [44] Convolutional Recurrent Neural Networks for Observation-Centered Plant Identification
    Liu, Xuanxin
    Xu, Fu
    Sun, Yu
    Zhang, Haiyan
    Chen, Zhibo
    JOURNAL OF ELECTRICAL AND COMPUTER ENGINEERING, 2018, 2018 (2018)
  • [45] A Hybrid Scene Text Script Identification Network for Regional Indian Languages
    Naosekpam, Veronica
    Sahu, Nilkanta
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2024, 23 (08)
  • [46] Convolutional Neural Networks with Recurrent Neural Filters
    Yang, Yi
    2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 912 - 917
  • [47] Text detection, recognition, and script identification in natural scene images: a Review
    Veronica Naosekpam
    Nilkanta Sahu
    International Journal of Multimedia Information Retrieval, 2022, 11 : 291 - 314
  • [48] Aerial Scene Classification with Convolutional Neural Networks
    Jia, Sibo
    Liu, Huaping
    Sun, Fuchun
    ADVANCES IN NEURAL NETWORKS - ISNN 2015, 2015, 9377 : 258 - 265
  • [49] Text detection, recognition, and script identification in natural scene images: a Review
    Naosekpam, Veronica
    Sahu, Nilkanta
    INTERNATIONAL JOURNAL OF MULTIMEDIA INFORMATION RETRIEVAL, 2022, 11 (03) : 291 - 314
  • [50] Scene Disparity Estimation with Convolutional Neural Networks
    Anas, Essa R.
    Guo, Li
    Onsy, Ahmed
    Matuszewski, Bogdan J.
    MULTIMODAL SENSING: TECHNOLOGIES AND APPLICATIONS, 2019, 11059