Semantic representation for visual reasoning

被引：24

作者：

Ni, Xubin ^{[1
]}

Yin, Lirong ^{[2
]}

Chen, Xiaobing ^{[1
]}

Liu, Shan ^{[1
,3
]}

Yang, Bo ^{[1
]}

Zheng, Wenfeng ^{[1
]}

机构：

[1] Univ Elect Sci & Technol China, Sch Automat, Chengdu 610054, Sichuan, Peoples R China

[2] Univ Iowa, Geog & Sustainabil Sci Dept, Iowa City, IA 52242 USA

[3] Old Dominion Univ, Dept Modelling Simulat & Visualizat Engn, Norfolk, VA 23529 USA

来源：

2018 INTERNATIONAL JOINT CONFERENCE ON METALLURGICAL AND MATERIALS ENGINEERING (JCMME 2018) | 2019年 / 277卷

关键词：

D O I：

10.1051/matecconf/201927702006

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

In the field of visual reasoning, image features are widely used as the input of neural networks to get answers. However, image features are too redundant to learn accurate characterizations for regular networks. While in human reasoning, abstract description is usually constructed to avoid irrelevant details. Inspired by this, a higher-level representation named semantic representation is introduced in this paper to make visual reasoning more efficient. The idea of the Gram matrix used in the neural style transfer research is transferred here to build a relation matrix which enables the related information between objects to be better represented. The model using semantic representation as input outperforms the same model using image features as input which verifies that more accurate results can be obtained through the introduction of high-level semantic representation in the field of visual reasoning.

引用

页数：6

共 50 条

[1] Improving Visual Reasoning Through Semantic Representation
Zheng, Wenfeng
Liu, Xiangjun
Ni, Xubin
Yin, Lirong
Yang, Bo
[J]. IEEE ACCESS, 2021, 9 : 91476 - 91486
[2] Person Re-Identification With Visual Semantic Representation Mining and Reasoning
Zhao, Chuang
Shi, Yuxuan
Ling, Hefei
Wang, Qian
Zhao, Chengxin
Chen, Jiazhong
Li, Ping
[J]. IEEE TRANSACTIONS ON BIOMETRICS, BEHAVIOR, AND IDENTITY SCIENCE, 2023, 5 (04): : 486 - 497
[3] Schema Reasoning and Semantic Representation for Citation Semantic Link Network
Chen, Weiling
Yin, Shiqun
Qiu, Yuhui
[J]. 2009 THIRD INTERNATIONAL SYMPOSIUM ON INTELLIGENT INFORMATION TECHNOLOGY APPLICATION, VOL 3, PROCEEDINGS, 2009, : 366 - 369
[4] VISUAL REASONING FOR DESIGN BY ANALOGY: FUSE VISUAL AND SEMANTIC KNOWLEDGE
Zhang, Zijian
Jin, Yan
[J]. PROCEEDINGS OF ASME 2022 INTERNATIONAL DESIGN ENGINEERING TECHNICAL CONFERENCES AND COMPUTERS AND INFORMATION IN ENGINEERING CONFERENCE, IDETC-CIE2022, VOL 6, 2022,
[5] Refining Visual Activity Recognition with Semantic Reasoning
Ramoly, Nathan
Vassout, Vincent
Bouzeghoub, Amel
El Yacoubi, Mounim A.
Hariz, Mossaab
[J]. 2017 IEEE 31ST INTERNATIONAL CONFERENCE ON ADVANCED INFORMATION NETWORKING AND APPLICATIONS (AINA), 2017, : 720 - 727
[6] Multiview Semantic Representation for Visual Recognition
Zhang, Chunjie
Cheng, Jian
Tian, Qi
[J]. IEEE TRANSACTIONS ON CYBERNETICS, 2020, 50 (05) : 2038 - 2049
[7] A New Method for Knowledge Representation and Reasoning of Semantic Web
Wang, Jie
Han, X. -P.
Zhang, Y. -Y.
Liu, C. -N.
[J]. 2011 INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND NEURAL COMPUTING (FSNC 2011), VOL IV, 2011, : 34 - 37
[8] A New Method for Knowledge Representation and Reasoning of Semantic Web
Wang, Jie
Han, X. -P.
Zhang, Y. -Y.
Liu, C-N
[J]. 2011 AASRI CONFERENCE ON INFORMATION TECHNOLOGY AND ECONOMIC DEVELOPMENT (AASRI-ITED 2011), VOL 1, 2011, : 34 - 37
[9] Visual Semantic Reasoning for Image-Text Matching
Li, Kunpeng
Zhang, Yulun
Li, Kai
Li, Yuanyuan
Fu, Yun
[J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 4653 - 4661
[10] Semantic-aware visual scene representation
Mohammad Javad Parseh
Mohammad Rahmanimanesh
Parviz Keshavarzi
Zohreh Azimifar
[J]. International Journal of Multimedia Information Retrieval, 2022, 11 : 619 - 638

← 1 2 3 4 5 →