Knowledge is power: Open-world knowledge representation learning for knowledge-based visual reasoning ☆,☆☆ , ☆☆

被引:1
|
作者
Zheng, Wenbo [1 ]
Yan, Lan [2 ]
Wang, Fei-Yue [3 ]
机构
[1] Wuhan Univ Technol, Sch Comp Sci & Artificial Intelligence, Wuhan 430070, Peoples R China
[2] Hunan Univ, Coll Comp Sci & Engn, Changsha 410082, Peoples R China
[3] Chinese Acad Sci, Inst Automat, Beijing 100190, Peoples R China
基金
海南省自然科学基金; 国家重点研发计划;
关键词
Visual reasoning; Knowledge representation learning; Open-world learning; Graph model; LANGUAGE; VISION;
D O I
10.1016/j.artint.2024.104147
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Knowledge-based visual reasoning requires the ability to associate outside knowledge that is not present in a given image for cross-modal visual understanding. Two deficiencies of the existing approaches are that (1) they only employ or construct elementary and explicit but superficial knowledge graphs while lacking complex and implicit but indispensable cross-modal knowledge for visual reasoning, and (2) they also cannot reason new/ unseen images or questions in open environments and are often violated in real-world applications. How to represent and leverage tacit multimodal knowledge for open-world visual reasoning scenarios has been less studied. In this paper, we propose a novel open-world knowledge representation learning method to not only construct implicit knowledge representations from the given images and their questions but also enable knowledge transfer from a known given scene to an unknown scene for answer prediction. Extensive experiments conducted on six benchmarks demonstrate the superiority of our approach over other state-of-the-art methods. We apply our approach to other visual reasoning tasks, and the experimental results show that our approach, with its good performance, can support related reasoning applications.
引用
收藏
页数:26
相关论文
共 50 条
  • [21] Visual Chain-of-Thought Prompting for Knowledge-Based Visual Reasoning
    Chen, Zhenfang
    Zhou, Qinhong
    Shen, Yikang
    Hong, Yining
    Sun, Zhiqing
    Gutfreund, Dan
    Gan, Chuang
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 2, 2024, : 1254 - 1262
  • [22] A KNOWLEDGE REPRESENTATION MODEL FOR MULTIUSER KNOWLEDGE-BASED SYSTEMS
    BASU, A
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 1993, 5 (02) : 177 - 189
  • [23] Investigating Domain Knowledge Graph Knowledge Reasoning and Assessing Quality Using Knowledge Representation Learning and Knowledge Reasoning Algorithms
    Cao, Ying
    JOURNAL OF INFORMATION & KNOWLEDGE MANAGEMENT, 2025, 24 (01)
  • [24] Mathematics teacher’s knowledge, knowledge-based reasoning, and contexts
    Salvador Llinares
    Journal of Mathematics Teacher Education, 2018, 21 : 1 - 3
  • [25] Mathematics teacher's knowledge, knowledge-based reasoning, and contexts
    Llinares, Salvador
    JOURNAL OF MATHEMATICS TEACHER EDUCATION, 2018, 21 (01) : 1 - 3
  • [26] SOK-Bench: A Situated Video Reasoning Benchmark with Aligned Open-World Knowledge
    Wang, Andong
    Wu, Bo
    Chen, Sunli
    Chen, Zhenfang
    Guan, Haotian
    Lee, Wei-Ning
    Li, Li Erran
    Gan, Chuang
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 13384 - 13394
  • [27] The joint knowledge reasoning model based on knowledge representation learning for aviation assembly domain
    Liu, PeiFeng
    Qian, Lu
    Lu, Hu
    Xue, Lei
    Zhao, XingWei
    Tao, Bo
    SCIENCE CHINA-TECHNOLOGICAL SCIENCES, 2024, 67 (01) : 143 - 156
  • [28] The joint knowledge reasoning model based on knowledge representation learning for aviation assembly domain
    LIU PeiFeng
    QIAN Lu
    LU Hu
    XUE Lei
    ZHAO XingWei
    TAO Bo
    Science China(Technological Sciences), 2024, 67 (01) : 143 - 156
  • [29] The joint knowledge reasoning model based on knowledge representation learning for aviation assembly domain
    PeiFeng Liu
    Lu Qian
    Hu Lu
    Lei Xue
    XingWei Zhao
    Bo Tao
    Science China Technological Sciences, 2024, 67 : 143 - 156
  • [30] Distributed representations of entities in open-world knowledge graphs
    Guo, Lingbing
    Chen, Zhuo
    Chen, Jiaoyan
    Zhang, Yichi
    Sun, Zequn
    Bo, Zhongpu
    Fang, Yin
    Liu, Xiaoze
    Chen, Huajun
    Zhang, Wen
    KNOWLEDGE-BASED SYSTEMS, 2024, 290