Bag of Local Convolutional Triplets for Script Identification in Scene Text

被引:8
|
作者
Zdenek, Jan [1 ]
Nakayama, Hideki [1 ]
机构
[1] Univ Tokyo, Grad Sch Informat Sci & Technol, Tokyo, Japan
关键词
script identification; scene text; convolutional neural networks; bag-of-visual words;
D O I
10.1109/ICDAR.2017.68
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The increasing interest in scene text reading in multilingual environments raises the need to recognize and distinguish between different writing systems. In this paper, we propose a novel method for script identification in scene text using triplets of local convolutional features in combination with the traditional bag-of-visual-words model. Feature triplets are created by making combinations of descriptors extracted from local patches of the input images using a convolutional neural network. This approach allows us to generate a more descriptive codeword dictionary for the bag-of-visual-words model, as the low discriminative power of weak descriptors is enhanced by other descriptors in a triplet. The proposed method is evaluated on two public benchmark datasets for scene text script identification and a public dataset for script identification in video captions. The experiments demonstrate that our method outperforms the baseline and yields competitive results on all three datasets.
引用
下载
收藏
页码:369 / 375
页数:7
相关论文
共 50 条
  • [21] SCENE TEXT RECOGNITION WITH TEMPORAL CONVOLUTIONAL ENCODER
    Du, Xiangcheng
    Ma, Tianlong
    Zheng, Yingbin
    Ye, Hao
    Wu, Xingjiao
    He, Liang
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 2383 - 2387
  • [22] Script identification in natural scene image and video frames using an attention based Convolutional-LSTM network
    Bhunia, Ankan Kumar
    Konwer, Aishik
    Bhunia, Ayan Kumar
    Bhowmick, Abir
    Roy, Partha P.
    Pal, Umapada
    PATTERN RECOGNITION, 2019, 85 : 172 - 184
  • [23] CNN Based Transfer Learning for Scene Script Identification
    Tounsi, Maroua
    Moalla, Ikram
    Lebourgeois, Frank
    Alimi, Adel M.
    NEURAL INFORMATION PROCESSING (ICONIP 2017), PT VI, 2017, 10639 : 702 - 711
  • [24] Video Script Identification based on Text Lines
    Trung Quy Phan
    Shivakumara, Palaiahnakote
    Ding, Zhang
    Lu, Shijian
    Tan, Chew Lim
    11TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR 2011), 2011, : 1240 - 1244
  • [25] Text-Attentional Convolutional Neural Network for Scene Text Detection
    He, Tong
    Huang, Weilin
    Qiao, Yu
    Yao, Jian
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2016, 25 (06) : 2529 - 2541
  • [26] A New Lightweight Script Independent Scene Text Style Transfer Network
    Shivakumara, Palaiahnakote
    Roy, Ayush
    Nandanwar, Lokesh
    Pal, Umapada
    Lu, Yue
    Liu, Cheng-Lin
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2023, 37 (13)
  • [27] Multi-script text versus non-text classification of regions in scene images
    Sriman, Bowornrat
    Schomaker, Lambert
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2019, 62 : 23 - 42
  • [28] Local word bag model for text categorization
    Pu, Wen
    Liu, Ning
    Yan, Shuicheng
    Yan, Jun
    Xie, Kunqing
    Chen, Zheng
    ICDM 2007: PROCEEDINGS OF THE SEVENTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, 2007, : 625 - +
  • [29] LCSTR: Scene Text Recognition with Large Convolutional Kernels
    Wang, Jiale
    Yang, Lina
    Wang, Jing
    Yang, Haoyan
    Bai, Lin
    Wang, Patrick Shen-Pei
    Li, Xichun
    Lu, Huiwu
    Xu, Huafu
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2024, 38 (01)
  • [30] Random Projected Convolutional Feature for Scene Text Recognition
    Wu, Rui
    Yang, Shuli
    Leng, Dawei
    Luo, Zhenbo
    Wang, Yunhong
    PROCEEDINGS OF 2016 15TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR), 2016, : 132 - 137