Scene recognition by semantic visual words

被引:0
|
作者
Elahe Farahzadeh
Tat-Jen Cham
Andrzej Sluzek
机构
[1] Nanyang Technological University,Center of Computational Intelligence, School of Computer Engineering
[2] Nanyang Technological University,Center for Multimedia and Network Technology, School of Computer Engineering
[3] Khalifa University of Science Technology and Research,Department of Electrical and Computer Engineering
来源
关键词
Scene recognition; Semantic vocabulary; Visual words;
D O I
暂无
中图分类号
学科分类号
摘要
In this paper, we propose a novel approach to introduce semantic relations into the bag-of-words framework. We use the latent semantic models, such as latent semantic analysis (LSA) and probabilistic latent semantic analysis (pLSA), in order to define semantically rich features and embed the visual features into a semantic space. The semantic features used in LSA technique are derived from the low-rank approximation of word–image occurrence matrix by singular value decomposition. Similarly, by using the pLSA approach, the topic-specific distributions of words can be considered dimensions of a concept space. In the proposed space, the distances between words represent the semantic distances which are used for constructing a discriminative and semantically meaningful vocabulary. Position information significantly improves scene recognition accuracy. Inspired by this, in this paper, we bring position information into the proposed semantic vocabulary frameworks. We have tested our approach on the 15-Scene and 67-MIT Indoor datasets and have achieved very promising results.
引用
收藏
页码:1935 / 1944
页数:9
相关论文
共 50 条
  • [1] Scene recognition by semantic visual words
    Farahzadeh, Elahe
    Cham, Tat-Jen
    Sluzek, Andrzej
    [J]. SIGNAL IMAGE AND VIDEO PROCESSING, 2015, 9 (08) : 1935 - 1944
  • [2] Discriminating Semantic Visual Words for Scene Classification
    Liu, Shuoyan
    Xu, De
    Feng, Songhe
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2010, E93D (06) : 1580 - 1588
  • [3] Robustifying Visual Place Recognition with Semantic Scene Categorization
    Arshad, Saba
    Kim, Gon-Woo
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (BIGCOMP 2020), 2020, : 467 - 469
  • [4] Emotional Semantic Recognition of Visual Scene in Flash Animation
    Shi Lin
    Xu Zhenguo
    Meng Xiangzeng
    [J]. JOURNAL OF CONTROL SCIENCE AND ENGINEERING, 2018, 2018
  • [5] Hierarchical visual-semantic interaction for scene text recognition
    Diao, Liang
    Tang, Xin
    Wang, Jun
    Xie, Guotong
    Hu, Junlin
    [J]. INFORMATION FUSION, 2024, 102
  • [6] Multimodal Visual-Semantic Representations Learning for Scene Text Recognition
    Gao, Xinjian
    Pang, Ye
    Liu, Yuyu
    Han, Maokun
    Yu, Jun
    Wang, Wei
    Chen, Yuanxu
    [J]. ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 20 (07)
  • [7] A Hierarchical Utilization of Semantic Gradients and Scene Structure for Visual Place Recognition
    Bao, Yaoqi
    Pan, Yun
    Yang, Zhe
    Huan, Ruohong
    [J]. IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2024, 16 (02) : 570 - 583
  • [8] Scene Recognition on the Semantic Manifold
    Kwitt, Roland
    Vasconcelos, Nuno
    Rasiwasia, Nikhil
    [J]. COMPUTER VISION - ECCV 2012, PT IV, 2012, 7575 : 359 - 372
  • [9] Visual and semantic ensemble for scene text recognition with gated dual mutual attention
    Liu, Zhiguang
    Wang, Liangwei
    Qiao, Jian
    [J]. INTERNATIONAL JOURNAL OF MULTIMEDIA INFORMATION RETRIEVAL, 2022, 11 (04) : 669 - 680
  • [10] Visual and semantic ensemble for scene text recognition with gated dual mutual attention
    Zhiguang Liu
    Liangwei Wang
    Jian Qiao
    [J]. International Journal of Multimedia Information Retrieval, 2022, 11 : 669 - 680