Multiview Semantic Representation for Visual Recognition

被引:3
|
作者
Zhang, Chunjie [1 ,2 ]
Cheng, Jian [2 ,3 ,4 ]
Tian, Qi [5 ]
机构
[1] Chinese Acad Sci, Res Ctr Brain Inspired Intelligence, Inst Automat, Beijing 100190, Peoples R China
[2] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
[3] Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing 100190, Peoples R China
[4] Chinese Acad Sci, Ctr Excellence Brain Sci & Intelligence Technol, Beijing 100190, Peoples R China
[5] Univ Texas San Antonio, Dept Comp Sci, San Antonio, TX 78249 USA
基金
美国国家科学基金会;
关键词
Image classification; multiview; object categorization; semantic representation; visual recognition; IMAGE CLASSIFICATION; LOW-RANK; OBJECT CATEGORIZATION; FUSION; FACE;
D O I
10.1109/TCYB.2018.2875728
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Due to interclass and intraclass variations, the images of different classes are often cluttered which makes it hard for efficient classifications. The use of discriminative classification algorithms helps to alleviate this problem. However, it is still an open problem to accurately model the relationships between visual representations and human perception. To alleviate these problems, in this paper, we propose a novel multiview semantic representation (MVSR) algorithm for efficient visual recognition. First, we leverage visually based methods to get initial image representations. We then use both visual and semantic similarities to divide images into groups which are then used for semantic representations. We treat different image representation strategies, partition methods, and numbers as different views. A graph is then used to combine the discriminative power of different views. The similarities between images can be obtained by measuring the similarities of graphs. Finally, we train classifiers to predict the categories of images. We evaluate the discriminative power of the proposed MVSR method for visual recognition on several public image datasets. Experimental results show the effectiveness of the proposed method.
引用
收藏
页码:2038 / 2049
页数:12
相关论文
共 50 条
  • [1] A Semantic Higher-Level Visual Representation for Object Recognition
    El Sayad, Ismail
    Martinet, Jean
    Urruty, Thierry
    Dejraba, Chabane
    [J]. ADVANCES IN MULTIMEDIA MODELING, PT I, 2011, 6523 : 251 - 261
  • [2] Semantic Descriptors into Representation for Robust Indoor Visual Place Recognition
    Kim, Nuni
    Kang, Minjae
    Oh, Songhwai
    [J]. 2021 21ST INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS 2021), 2021, : 715 - 718
  • [3] Integrating Spatio-Temporal Context with Multiview Representation for Object Recognition in Visual Surveillance
    Liu, Xiaobai
    Lin, Liang
    Yan, Shuicheng
    Jin, Hai
    Tao, Wenbing
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2011, 21 (04) : 393 - 407
  • [4] Multiview Representation Learning for Human Activity Recognition
    Hamidi, Massinissa
    Osmani, Aomar
    Rasaq, Lukmon
    Dogan, Gulustan
    Alotaibi, Nouran
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND VIRTUAL ENVIRONMENTS FOR MEASUREMENT SYSTEMS AND APPLICATIONS (IEEE CIVEMSA 2022), 2022,
  • [5] Boosted random contextual semantic space based representation for visual recognition
    Zhang, Chunjie
    Xue, Zhe
    Zhu, Xiaobin
    Wang, Huanian
    Huang, Qingming
    Tian, Qi
    [J]. INFORMATION SCIENCES, 2016, 369 : 160 - 170
  • [6] Semantic representation for visual reasoning
    Ni, Xubin
    Yin, Lirong
    Chen, Xiaobing
    Liu, Shan
    Yang, Bo
    Zheng, Wenfeng
    [J]. 2018 INTERNATIONAL JOINT CONFERENCE ON METALLURGICAL AND MATERIALS ENGINEERING (JCMME 2018), 2019, 277
  • [7] Levels of representation in visual word recognition: A dissociation between morphological and semantic processing
    Stolz, JA
    Besner, D
    [J]. JOURNAL OF EXPERIMENTAL PSYCHOLOGY-HUMAN PERCEPTION AND PERFORMANCE, 1998, 24 (06) : 1642 - 1655
  • [8] Time Distributed Multiview Representation for Speech Emotion Recognition
    de Mattos, Flavia Leticia
    Pellenz, Marcelo E.
    Britto, Jr. Alceu de S.
    [J]. PROGRESS IN PATTERN RECOGNITION, IMAGE ANALYSIS, COMPUTER VISION, AND APPLICATIONS, CIARP 2023, PT I, 2024, 14469 : 148 - 162
  • [9] A Sensorimotor Perspective on Contrastive Multiview Visual Representation Learning
    Laflaquiere, Alban
    [J]. IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2022, 14 (02) : 269 - 278
  • [10] A Multiview Representation Framework for Micro-Expression Recognition
    Huang, Tianhuan
    Chen, Lei
    Feng, Yuncong
    Ben, Xianye
    Xiao, Ruixue
    Xue, Tianle
    [J]. IEEE ACCESS, 2019, 7 : 120670 - 120680