Bag of Local Convolutional Triplets for Script Identification in Scene Text

被引:8
|
作者
Zdenek, Jan [1 ]
Nakayama, Hideki [1 ]
机构
[1] Univ Tokyo, Grad Sch Informat Sci & Technol, Tokyo, Japan
关键词
script identification; scene text; convolutional neural networks; bag-of-visual words;
D O I
10.1109/ICDAR.2017.68
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The increasing interest in scene text reading in multilingual environments raises the need to recognize and distinguish between different writing systems. In this paper, we propose a novel method for script identification in scene text using triplets of local convolutional features in combination with the traditional bag-of-visual-words model. Feature triplets are created by making combinations of descriptors extracted from local patches of the input images using a convolutional neural network. This approach allows us to generate a more descriptive codeword dictionary for the bag-of-visual-words model, as the low discriminative power of weak descriptors is enhanced by other descriptors in a triplet. The proposed method is evaluated on two public benchmark datasets for scene text script identification and a public dataset for script identification in video captions. The experiments demonstrate that our method outperforms the baseline and yields competitive results on all three datasets.
引用
下载
收藏
页码:369 / 375
页数:7
相关论文
共 50 条
  • [41] Text Detection from Natural Scene Images for Manipuri Meetei Mayek Script
    Devi, Chingakham Neeta
    Devi, Haobam Mamata
    Das, Debaprasad
    2015 IEEE International Conference on Computer Graphics, Vision and Information Security (CGVIS), 2015, : 248 - 251
  • [42] Script independent approach for multi-oriented text detection in scene image
    Dey, Sounak
    Shivakumara, Palaiahnakote
    Raghunandan, K. S.
    Pal, Umapada
    Lu, Tong
    Kumar, G. Hemantha
    Chan, Chee Seng
    NEUROCOMPUTING, 2017, 242 : 96 - 112
  • [43] SARN: Script-Aware Recognition Network for scene multilingual text recognition
    Ke, Wenjun
    Hou, Qingzhi
    Liu, Yutian
    Song, Xinyue
    Wei, Jianguo
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 250
  • [44] Scene Text Extraction with Local Symmetry Transform
    Chen, Qi
    Song, Yonghong
    Zhang, Yuanlin
    PROCEEDINGS OF THE FIFTEENTH IAPR INTERNATIONAL CONFERENCE ON MACHINE VISION APPLICATIONS - MVA2017, 2017, : 246 - 249
  • [45] Cursive Scene Text Analysis by Deep Convolutional Linear Pyramids
    Bin Ahmed, Saad
    Naz, Saeeda
    Razzak, Muhammad Imran
    Yusof, Rubiyah
    NEURAL INFORMATION PROCESSING (ICONIP 2018), PT I, 2018, 11301 : 307 - 318
  • [46] Irregular Scene Text Detection Based on a Graph Convolutional Network
    Zhang, Shiyu
    Zhou, Caiying
    Li, Yonggang
    Zhang, Xianchao
    Ye, Lihua
    Wei, Yuanwang
    SENSORS, 2023, 23 (03)
  • [47] Learning Text Component Features via Convolutional Neural Networks for Scene Text Detection
    Khlif, Wafa
    Nayef, Nibal
    Burie, Jean-Christophe
    Ogier, Jean-Marc
    Alimi, Adel
    2018 13TH IAPR INTERNATIONAL WORKSHOP ON DOCUMENT ANALYSIS SYSTEMS (DAS), 2018, : 79 - 84
  • [48] Research and Analysis on Scene Identification from Text
    Dong, Runzhi
    Li, Hanjing
    Zhao, Tiejun
    Wang, Yuying
    11TH CHINESE LEXICAL SEMANTICS WORKSHOP (CKSW2010), 2010, : 185 - 191
  • [49] Script identification in the wild via discriminative convolutional neural network
    Shi, Baoguang
    Bai, Xiang
    Yao, Cong
    PATTERN RECOGNITION, 2016, 52 : 448 - 458
  • [50] Composite Script Identification and Orientation Detection for Indian Text Images
    Ghosh, Shamita
    Chaudhuri, Bidyut B.
    11TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR 2011), 2011, : 294 - 298