Bag of Local Convolutional Triplets for Script Identification in Scene Text

被引:8
|
作者
Zdenek, Jan [1 ]
Nakayama, Hideki [1 ]
机构
[1] Univ Tokyo, Grad Sch Informat Sci & Technol, Tokyo, Japan
关键词
script identification; scene text; convolutional neural networks; bag-of-visual words;
D O I
10.1109/ICDAR.2017.68
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The increasing interest in scene text reading in multilingual environments raises the need to recognize and distinguish between different writing systems. In this paper, we propose a novel method for script identification in scene text using triplets of local convolutional features in combination with the traditional bag-of-visual-words model. Feature triplets are created by making combinations of descriptors extracted from local patches of the input images using a convolutional neural network. This approach allows us to generate a more descriptive codeword dictionary for the bag-of-visual-words model, as the low discriminative power of weak descriptors is enhanced by other descriptors in a triplet. The proposed method is evaluated on two public benchmark datasets for scene text script identification and a public dataset for script identification in video captions. The experiments demonstrate that our method outperforms the baseline and yields competitive results on all three datasets.
引用
下载
收藏
页码:369 / 375
页数:7
相关论文
共 50 条
  • [31] Reading Scene Text with Aggregated Temporal Convolutional Encoder
    Ma, Tianlong
    Du, Xiangcheng
    Wu, Xingjiao
    Zhou, Zhao
    Zheng, Yingbin
    Jin, Cheng
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2023, 22 (11)
  • [32] RECURRENT GLOBAL CONVOLUTIONAL NETWORK FOR SCENE TEXT DETECTION
    Mohanty, Sabyasachi
    Dutta, Tanima
    Gupta, Hari Prabhat
    2018 25TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2018, : 2750 - 2754
  • [33] Scene text detection with fully convolutional neural networks
    Zhandong Liu
    Wengang Zhou
    Houqiang Li
    Multimedia Tools and Applications, 2019, 78 : 18205 - 18227
  • [34] Reading scene text with fully convolutional sequence modeling
    Gao, Yunze
    Chen, Yingying
    Wang, Jinqiao
    Tang, Ming
    Lu, Hanqing
    NEUROCOMPUTING, 2019, 339 : 161 - 170
  • [35] Scene text detection with fully convolutional neural networks
    Liu, Zhandong
    Zhou, Wengang
    Li, Houqiang
    MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (13) : 18205 - 18227
  • [36] SCENE TEXT RECOGNITION WITH DEEPER CONVOLUTIONAL NEURAL NETWORKS
    Zhang, Yuqi
    Wang, Wei
    Wang, Liang
    Wang, Liuan
    2015 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2015, : 2384 - 2388
  • [37] Scene Text Localization Using Lightweight Convolutional Networks
    Lorgus Decker, Luis Gustavo
    Pinto, Allan
    Flores Campana, Jose Luis
    Neira, Manuel Cordova
    dos Santos, Andreza Aparecida
    de Jesus Conceicao, Jhonatas Santos
    Pedrini, Helio
    Angeloni, Marcus de Assis
    Li, Lin Tzy
    Luvizon, Diogo Carbonera
    Torres, Ricardo da S.
    COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS, VISIGRAPP 2020, 2022, 1474 : 297 - 318
  • [38] Remote Sensing Image Scene Classification Using Bag of Convolutional Features
    Cheng, Gong
    Li, Zhenpeng
    Yao, Xiwen
    Guo, Lei
    Wei, Zhongliang
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2017, 14 (10) : 1735 - 1739
  • [39] Mining discriminative patches for script identification in natural scene images
    Lu, Liqiong
    Wu, Dong
    Tang, Ziwei
    Yi, Yaohua
    Huang, Faliang
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2021, 40 (01) : 551 - 563
  • [40] Word-Level Script Identification from Scene Images
    Fasil, O. K.
    Manjunath, S.
    Aradhya, V. N. Manjunath
    PROCEEDINGS OF THE 5TH INTERNATIONAL CONFERENCE ON FRONTIERS IN INTELLIGENT COMPUTING: THEORY AND APPLICATIONS, (FICTA 2016), VOL 2, 2017, 516 : 417 - 426