Semantic representation for visual reasoning

被引:24
|
作者
Ni, Xubin [1 ]
Yin, Lirong [2 ]
Chen, Xiaobing [1 ]
Liu, Shan [1 ,3 ]
Yang, Bo [1 ]
Zheng, Wenfeng [1 ]
机构
[1] Univ Elect Sci & Technol China, Sch Automat, Chengdu 610054, Sichuan, Peoples R China
[2] Univ Iowa, Geog & Sustainabil Sci Dept, Iowa City, IA 52242 USA
[3] Old Dominion Univ, Dept Modelling Simulat & Visualizat Engn, Norfolk, VA 23529 USA
关键词
D O I
10.1051/matecconf/201927702006
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
In the field of visual reasoning, image features are widely used as the input of neural networks to get answers. However, image features are too redundant to learn accurate characterizations for regular networks. While in human reasoning, abstract description is usually constructed to avoid irrelevant details. Inspired by this, a higher-level representation named semantic representation is introduced in this paper to make visual reasoning more efficient. The idea of the Gram matrix used in the neural style transfer research is transferred here to build a relation matrix which enables the related information between objects to be better represented. The model using semantic representation as input outperforms the same model using image features as input which verifies that more accurate results can be obtained through the introduction of high-level semantic representation in the field of visual reasoning.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] Improving Visual Reasoning Through Semantic Representation
    Zheng, Wenfeng
    Liu, Xiangjun
    Ni, Xubin
    Yin, Lirong
    Yang, Bo
    [J]. IEEE ACCESS, 2021, 9 : 91476 - 91486
  • [2] Person Re-Identification With Visual Semantic Representation Mining and Reasoning
    Zhao, Chuang
    Shi, Yuxuan
    Ling, Hefei
    Wang, Qian
    Zhao, Chengxin
    Chen, Jiazhong
    Li, Ping
    [J]. IEEE TRANSACTIONS ON BIOMETRICS, BEHAVIOR, AND IDENTITY SCIENCE, 2023, 5 (04): : 486 - 497
  • [3] Schema Reasoning and Semantic Representation for Citation Semantic Link Network
    Chen, Weiling
    Yin, Shiqun
    Qiu, Yuhui
    [J]. 2009 THIRD INTERNATIONAL SYMPOSIUM ON INTELLIGENT INFORMATION TECHNOLOGY APPLICATION, VOL 3, PROCEEDINGS, 2009, : 366 - 369
  • [4] VISUAL REASONING FOR DESIGN BY ANALOGY: FUSE VISUAL AND SEMANTIC KNOWLEDGE
    Zhang, Zijian
    Jin, Yan
    [J]. PROCEEDINGS OF ASME 2022 INTERNATIONAL DESIGN ENGINEERING TECHNICAL CONFERENCES AND COMPUTERS AND INFORMATION IN ENGINEERING CONFERENCE, IDETC-CIE2022, VOL 6, 2022,
  • [5] Refining Visual Activity Recognition with Semantic Reasoning
    Ramoly, Nathan
    Vassout, Vincent
    Bouzeghoub, Amel
    El Yacoubi, Mounim A.
    Hariz, Mossaab
    [J]. 2017 IEEE 31ST INTERNATIONAL CONFERENCE ON ADVANCED INFORMATION NETWORKING AND APPLICATIONS (AINA), 2017, : 720 - 727
  • [6] Multiview Semantic Representation for Visual Recognition
    Zhang, Chunjie
    Cheng, Jian
    Tian, Qi
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2020, 50 (05) : 2038 - 2049
  • [7] A New Method for Knowledge Representation and Reasoning of Semantic Web
    Wang, Jie
    Han, X. -P.
    Zhang, Y. -Y.
    Liu, C. -N.
    [J]. 2011 INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND NEURAL COMPUTING (FSNC 2011), VOL IV, 2011, : 34 - 37
  • [8] A New Method for Knowledge Representation and Reasoning of Semantic Web
    Wang, Jie
    Han, X. -P.
    Zhang, Y. -Y.
    Liu, C-N
    [J]. 2011 AASRI CONFERENCE ON INFORMATION TECHNOLOGY AND ECONOMIC DEVELOPMENT (AASRI-ITED 2011), VOL 1, 2011, : 34 - 37
  • [9] Visual Semantic Reasoning for Image-Text Matching
    Li, Kunpeng
    Zhang, Yulun
    Li, Kai
    Li, Yuanyuan
    Fu, Yun
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 4653 - 4661
  • [10] Semantic-aware visual scene representation
    Mohammad Javad Parseh
    Mohammad Rahmanimanesh
    Parviz Keshavarzi
    Zohreh Azimifar
    [J]. International Journal of Multimedia Information Retrieval, 2022, 11 : 619 - 638