Towards Visual Words to Words Text Detection with a General Bag of Words Representation

被引:0
|
作者
Mehta, Rakesh [1 ]
Chum, Ondrej [2 ]
Matas, Jiri [2 ]
机构
[1] Tampere Univ Technol, Dept Signal Proc, FIN-33101 Tampere, Finland
[2] Czech Tech Univ, Fac Elect Engn, Ctr Machine Pecept, Dept Cybernet, Prague, Czech Republic
关键词
IMAGES;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We address the problem of text localization and retrieval in real world images. We are first to study the retrieval of text images, i.e. the selection of images containing text in large collections at high speed. We propose a novel representation, textual visual words, which describe text by generic visual words that geometrically consistently predict bottom and top lines of text. The visual words are discretized SIFT descriptors of Hessian features. The features may correspond to various structures present in the text - character fragments, individual characters or their arrangements. The textual words representation is invariant to affine transformation of the image and local linear change of intensity. Experiments demonstrate that the proposed method outperforms the state-of-the-art on the MS dataset. The proposed method detects blurry, small font, low contrast, noisy text from real world images.
引用
收藏
页码:641 / 645
页数:5
相关论文
共 50 条
  • [31] Weighted Bag of Visual Words with enhanced deep features for melanoma detection
    Okur, Erdem
    Turkan, Mehmet
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 237
  • [32] Transportation Object Detection with Bag of Visual Words Model by PLSA and MLP
    Hyun Chul Song
    Kwang Nam Choi
    Mobile Networks and Applications, 2018, 23 : 1103 - 1110
  • [33] Semantic Bag-of-Words Models for Visual Concept Detection and Annotation
    Zhang, Yu
    Bres, Stphane
    Chen, Liming
    8TH INTERNATIONAL CONFERENCE ON SIGNAL IMAGE TECHNOLOGY & INTERNET BASED SYSTEMS (SITIS 2012), 2012, : 289 - 295
  • [34] Transportation Object Detection with Bag of Visual Words Model by PLSA and MLP
    Song, Hyun Chul
    Choi, Kwang Nam
    MOBILE NETWORKS & APPLICATIONS, 2018, 23 (04): : 1103 - 1110
  • [35] VISUAL VOICE ACTIVITY DETECTION BASED ON SPATIOTEMPORAL INFORMATION AND BAG OF WORDS
    Patrona, Foteini
    Iosifidis, Alexandros
    Tefas, Anastasios
    Nikolaidis, Nikolaos
    Pitas, Ioannis
    2015 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2015, : 2334 - 2338
  • [36] A New Text Representation Scheme Combining Bag-of-Words and Bag-of-Concepts Approaches for Automatic Text Classification
    Alahmadi, Alaa
    Joorabchi, Arash
    Mahdi, Abdulhussain E.
    2013 7TH IEEE GCC CONFERENCE AND EXHIBITION (GCC), 2013, : 108 - 113
  • [37] Fuzzy Bag-of-Words Model for Document Representation
    Zhao, Rui
    Mao, Kezhi
    IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2018, 26 (02) : 794 - 804
  • [38] The locally weighted bag of words framework for document representation
    Lebanon, Guy
    Mao, Yi
    Dillon, Joshua
    Journal of Machine Learning Research, 2007, 8 : 2405 - 2441
  • [39] The locally weighted bag of words framework for document representation
    Lebanon, Guy
    Mao, Yi
    Dillon, Joshua
    JOURNAL OF MACHINE LEARNING RESEARCH, 2007, 8 : 2405 - 2441
  • [40] WORDS ABOUT WORDS ABOUT WORDS - THEORY, CRITICISM, AND THE LITERARY TEXT
    KRIEGER, M
    ACADEME-BULLETIN OF THE AAUP, 1984, 70 (01): : 17 - 24