Bag of Local Convolutional Triplets for Script Identification in Scene Text

被引:8
|
作者
Zdenek, Jan [1 ]
Nakayama, Hideki [1 ]
机构
[1] Univ Tokyo, Grad Sch Informat Sci & Technol, Tokyo, Japan
关键词
script identification; scene text; convolutional neural networks; bag-of-visual words;
D O I
10.1109/ICDAR.2017.68
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The increasing interest in scene text reading in multilingual environments raises the need to recognize and distinguish between different writing systems. In this paper, we propose a novel method for script identification in scene text using triplets of local convolutional features in combination with the traditional bag-of-visual-words model. Feature triplets are created by making combinations of descriptors extracted from local patches of the input images using a convolutional neural network. This approach allows us to generate a more descriptive codeword dictionary for the bag-of-visual-words model, as the low discriminative power of weak descriptors is enhanced by other descriptors in a triplet. The proposed method is evaluated on two public benchmark datasets for scene text script identification and a public dataset for script identification in video captions. The experiments demonstrate that our method outperforms the baseline and yields competitive results on all three datasets.
引用
下载
收藏
页码:369 / 375
页数:7
相关论文
共 50 条
  • [1] Scene Text Script Identification with Convolutional Recurrent Neural Networks
    Mei, Jieru
    Dai, Luo
    Shi, Baoguang
    Bai, Xiang
    2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 4053 - 4058
  • [2] Adaptive feature fusion for scene text script identification
    Peng, Fuyou
    Ma, Hui
    Liu, Li
    Lu, Yue
    Suen, Ching Y.
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (23) : 62677 - 62699
  • [3] A Method of Text Detection and Script Identification in Natural Scene
    Yang, Yaowei
    Ibrahim, Galip
    Zhu, Yali
    Mamat, Hornisa
    Ubul, Kurban
    2022 INTERNATIONAL CONFERENCE ON VIRTUAL REALITY, HUMAN-COMPUTER INTERACTION AND ARTIFICIAL INTELLIGENCE, VRHCIAI, 2022, : 43 - 48
  • [4] A fine-grained approach to scene text script identification
    Gomez, Lluis
    Karatzas, Dimosthenis
    PROCEEDINGS OF 12TH IAPR WORKSHOP ON DOCUMENT ANALYSIS SYSTEMS, (DAS 2016), 2016, : 192 - 197
  • [5] Script Identification from Camera-Captured Multi-script Scene Text Components
    Jajoo, Madhuram
    Chakraborty, Neelotpal
    Mollah, Ayatullah Faruk
    Basu, Subhadip
    Sarkar, Ram
    RECENT DEVELOPMENTS IN MACHINE LEARNING AND DATA ANALYTICS, 2019, 740 : 159 - 166
  • [6] A Hybrid Scene Text Script Identification Network for Regional Indian Languages
    Naosekpam, Veronica
    Sahu, Nilkanta
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2024, 23 (08)
  • [7] Text detection, recognition, and script identification in natural scene images: a Review
    Veronica Naosekpam
    Nilkanta Sahu
    International Journal of Multimedia Information Retrieval, 2022, 11 : 291 - 314
  • [8] Text detection, recognition, and script identification in natural scene images: a Review
    Naosekpam, Veronica
    Sahu, Nilkanta
    INTERNATIONAL JOURNAL OF MULTIMEDIA INFORMATION RETRIEVAL, 2022, 11 (03) : 291 - 314
  • [9] Text detection and script identification in natural scene images using deep learning
    Khalil, Ashwaq
    Jarrah, Moath
    Al-Ayyoub, Mahmoud
    Jararweh, Yaser
    COMPUTERS & ELECTRICAL ENGINEERING, 2021, 91
  • [10] Unconstrained Scene Text and Video Text Recognition for Arabic Script
    Jain, Mohit
    Mathew, Minesh
    Jawahar, C. V.
    2017 1ST INTERNATIONAL WORKSHOP ON ARABIC SCRIPT ANALYSIS AND RECOGNITION (ASAR), 2017, : 26 - 30