Scene Text Script Identification with Convolutional Recurrent Neural Networks

被引:0
|
作者
Mei, Jieru [1 ]
Dai, Luo [2 ]
Shi, Baoguang [2 ]
Bai, Xiang [2 ]
机构
[1] Huazhong Univ Sci & Technol, Sch Automat, Wuhan 430074, Hubei, Peoples R China
[2] Huazhong Univ Sci & Technol, Sch Elect Informat & Commun, Wuhan 430074, Hubei, Peoples R China
基金
中国国家自然科学基金;
关键词
FEATURES;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Script identification for scene text images is a challenging task. This paper describes a novel deep neural network structure that efficiently identifies scripts of images. In our design, we exploit two important factors, namely the image representation, and the spatial dependencies within text lines. To this end, we bring together a Convolutional Neural Network (CNN) and a Recurrent Neural Network (RNN) into one end-to-end trainable network. The former generates rich image representations, while the latter effectively analyzes long-term spatial dependencies. Besides, on top of the structure, we adopt an average pooling structure in order to deal with input images of arbitrary sizes. Experiments on several datasets, including SIW-13 and CVSI2015, demonstrate that our approach achieves superior performance, compared with previous approaches.
引用
下载
收藏
页码:4053 / 4058
页数:6
相关论文
共 50 条
  • [31] Text Detection and Recognition for Natural Scene Images Using Deep Convolutional Neural Networks
    Wu, Xianyu
    Luo, Chao
    Zhang, Qian
    Zhou, Jiliu
    Yang, Hao
    Li, Yulian
    CMC-COMPUTERS MATERIALS & CONTINUA, 2019, 61 (01): : 289 - 300
  • [32] Identification of handwritten Gujarati alphanumeric script by integrating transfer learning and convolutional neural networks
    Limbachiya, Krishn
    Sharma, Ankit
    Thakkar, Priyank
    Adhyaru, Dipak
    SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES, 2022, 47 (02):
  • [33] Identification of handwritten Gujarati alphanumeric script by integrating transfer learning and convolutional neural networks
    Krishn Limbachiya
    Ankit Sharma
    Priyank Thakkar
    Dipak Adhyaru
    Sādhanā, 2022, 47
  • [34] Towards End-to-end Text Spotting with Convolutional Recurrent Neural Networks
    Li, Hui
    Wang, Peng
    Shen, Chunhua
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 5248 - 5256
  • [35] On the improvement of handwritten text line recognition with octave convolutional recurrent neural networks
    Castro, Dayvid
    Zanchettin, Cleber
    Amaral, Luis A. Nunes
    INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2024, 27 (4) : 567 - 581
  • [36] Text-independent writer identification using convolutional neural networks
    Nguyen, Hung Tuan
    Nguyen, Cuong Tuan
    Ino, Takeya
    Indurkhya, Bipin
    Nakagawa, Masaki
    arXiv, 2020,
  • [37] Text-Attentional Convolutional Neural Network for Scene Text Detection
    He, Tong
    Huang, Weilin
    Qiao, Yu
    Yao, Jian
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2016, 25 (06) : 2529 - 2541
  • [38] Convolutional Neural Networks for Text Hashing
    Xu, Jiaming
    Wang, Peng
    Tian, Guanhua
    Xu, Bo
    Zhao, Jun
    Wang, Fangyuan
    Hao, Hongwei
    PROCEEDINGS OF THE TWENTY-FOURTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI), 2015, : 1369 - 1375
  • [39] Text normalization with convolutional neural networks
    Yolchuyeva, Sevinj
    Nemeth, Geza
    Gyires-Toth, Balint
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2018, 21 (03) : 589 - 600
  • [40] Scene Text Localization Using Lightweight Convolutional Networks
    Lorgus Decker, Luis Gustavo
    Pinto, Allan
    Flores Campana, Jose Luis
    Neira, Manuel Cordova
    dos Santos, Andreza Aparecida
    de Jesus Conceicao, Jhonatas Santos
    Pedrini, Helio
    Angeloni, Marcus de Assis
    Li, Lin Tzy
    Luvizon, Diogo Carbonera
    Torres, Ricardo da S.
    COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS, VISIGRAPP 2020, 2022, 1474 : 297 - 318