Knowledge is power: Open-world knowledge representation learning for knowledge-based visual reasoning ☆,☆☆ , ☆☆

被引:1
|
作者
Zheng, Wenbo [1 ]
Yan, Lan [2 ]
Wang, Fei-Yue [3 ]
机构
[1] Wuhan Univ Technol, Sch Comp Sci & Artificial Intelligence, Wuhan 430070, Peoples R China
[2] Hunan Univ, Coll Comp Sci & Engn, Changsha 410082, Peoples R China
[3] Chinese Acad Sci, Inst Automat, Beijing 100190, Peoples R China
基金
海南省自然科学基金; 国家重点研发计划;
关键词
Visual reasoning; Knowledge representation learning; Open-world learning; Graph model; LANGUAGE; VISION;
D O I
10.1016/j.artint.2024.104147
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Knowledge-based visual reasoning requires the ability to associate outside knowledge that is not present in a given image for cross-modal visual understanding. Two deficiencies of the existing approaches are that (1) they only employ or construct elementary and explicit but superficial knowledge graphs while lacking complex and implicit but indispensable cross-modal knowledge for visual reasoning, and (2) they also cannot reason new/ unseen images or questions in open environments and are often violated in real-world applications. How to represent and leverage tacit multimodal knowledge for open-world visual reasoning scenarios has been less studied. In this paper, we propose a novel open-world knowledge representation learning method to not only construct implicit knowledge representations from the given images and their questions but also enable knowledge transfer from a known given scene to an unknown scene for answer prediction. Extensive experiments conducted on six benchmarks demonstrate the superiority of our approach over other state-of-the-art methods. We apply our approach to other visual reasoning tasks, and the experimental results show that our approach, with its good performance, can support related reasoning applications.
引用
收藏
页数:26
相关论文
共 50 条
  • [31] An Open-World Extension to Knowledge Graph Completion Models
    Shah, Haseeb
    Villmow, Johannes
    Ulges, Adrian
    Schwanecke, Ulrich
    Shafait, Faisal
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 3044 - 3051
  • [32] Analogical reasoning in knowledge-based systems
    Zhou, HH
    CRITICAL TECHNOLOGY: PROCEEDINGS OF THE THIRD WORLD CONGRESS ON EXPERT SYSTEMS, VOLS I AND II, 1996, : 1136 - 1142
  • [33] Evaluating knowledge-based statistical reasoning
    Royalty, J
    PSYCHOLOGICAL REPORTS, 1995, 77 (03) : 1323 - 1327
  • [34] Consequential reasoning in knowledge-based systems
    Hudson, DL
    Cohen, ME
    COMPUTERS AND THEIR APPLICATIONS, 2001, : 342 - 345
  • [35] Knowledge-based semantic reasoning for creativity
    Jing D.
    Tian Y.
    Zhang C.
    Yang C.
    Yang H.
    International Journal of Performability Engineering, 2020, 16 (05) : 800 - 810
  • [36] GENERIC TASKS IN KNOWLEDGE-BASED REASONING
    THIBAULT, RC
    IEEE EXPERT-INTELLIGENT SYSTEMS & THEIR APPLICATIONS, 1987, 2 (02): : 5 - 5
  • [37] A knowledge-based system for prototypical reasoning
    Lieto, Antonio
    Minieri, Andrea
    Piana, Alberto
    Radicioni, Daniele P.
    CONNECTION SCIENCE, 2015, 27 (02) : 137 - 152
  • [38] Promote knowledge mining towards open-world semi-supervised learning
    Zhao, Tianhao
    Lin, Yutian
    Wu, Yu
    Du, Bo
    PATTERN RECOGNITION, 2024, 149
  • [39] Geometric reasoning for knowledge-based parametric design using graph representation
    Lee, JY
    Kim, K
    COMPUTER-AIDED DESIGN, 1996, 28 (10) : 831 - 841
  • [40] Integrating Image-Based and Knowledge-Based Representation Learning
    Xie, Ruobing
    Heinrich, Stefan
    Liu, Zhiyuan
    Weber, Cornelius
    Yao, Yuan
    Wermter, Stefan
    Sun, Maosong
    IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2020, 12 (02) : 169 - 178