Knowledge is power: Open-world knowledge representation learning for knowledge-based visual reasoning ☆,☆☆ , ☆☆

被引:1
|
作者
Zheng, Wenbo [1 ]
Yan, Lan [2 ]
Wang, Fei-Yue [3 ]
机构
[1] Wuhan Univ Technol, Sch Comp Sci & Artificial Intelligence, Wuhan 430070, Peoples R China
[2] Hunan Univ, Coll Comp Sci & Engn, Changsha 410082, Peoples R China
[3] Chinese Acad Sci, Inst Automat, Beijing 100190, Peoples R China
基金
海南省自然科学基金; 国家重点研发计划;
关键词
Visual reasoning; Knowledge representation learning; Open-world learning; Graph model; LANGUAGE; VISION;
D O I
10.1016/j.artint.2024.104147
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Knowledge-based visual reasoning requires the ability to associate outside knowledge that is not present in a given image for cross-modal visual understanding. Two deficiencies of the existing approaches are that (1) they only employ or construct elementary and explicit but superficial knowledge graphs while lacking complex and implicit but indispensable cross-modal knowledge for visual reasoning, and (2) they also cannot reason new/ unseen images or questions in open environments and are often violated in real-world applications. How to represent and leverage tacit multimodal knowledge for open-world visual reasoning scenarios has been less studied. In this paper, we propose a novel open-world knowledge representation learning method to not only construct implicit knowledge representations from the given images and their questions but also enable knowledge transfer from a known given scene to an unknown scene for answer prediction. Extensive experiments conducted on six benchmarks demonstrate the superiority of our approach over other state-of-the-art methods. We apply our approach to other visual reasoning tasks, and the experimental results show that our approach, with its good performance, can support related reasoning applications.
引用
收藏
页数:26
相关论文
共 50 条
  • [41] Prior Knowledge-Based Probabilistic Collaborative Representation for Visual Recognition
    Lan, Rushi
    Zhou, Yicong
    Liu, Zhenbing
    Luo, Xiaonan
    IEEE TRANSACTIONS ON CYBERNETICS, 2020, 50 (04) : 1498 - 1508
  • [42] The effect of knowledge representation schemes on maintainability of knowledge-based systems
    Lee, SR
    OKeefe, RM
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 1996, 8 (01) : 173 - 178
  • [43] Curriculum knowledge representation and manipulation in knowledge-based tutoring systems
    Zhou, G
    Wang, JTL
    Ng, PA
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 1996, 8 (05) : 679 - 689
  • [44] PRACTICAL, OBJECT-BASED KNOWLEDGE REPRESENTATION FOR KNOWLEDGE-BASED SYSTEMS
    PATELSCHNEIDER, PF
    INFORMATION SYSTEMS, 1990, 15 (01) : 9 - 19
  • [45] KNOWLEDGE REPRESENTATION AND REASONING
    LEVESQUE, HJ
    ANNUAL REVIEW OF COMPUTER SCIENCE, 1986, 1 : 255 - 287
  • [46] Temporal Knowledge Graph Reasoning Based on Evolutional Representation Learning
    Li, Zixuan
    Jin, Xiaolong
    Li, Wei
    Guan, Saiping
    Guo, Jiafeng
    Shen, Huawei
    Wang, Yuanzhuo
    Cheng, Xueqi
    SIGIR '21 - PROCEEDINGS OF THE 44TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2021, : 408 - 417
  • [47] Knowledge-based representation of fuzzy sets
    Intan, R
    Mukaidono, M
    Emoto, M
    PROCEEDINGS OF THE 2002 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOL 1 & 2, 2002, : 590 - 595
  • [48] The Core of Smart Cities: Knowledge Representation and Descriptive Framework Construction in Knowledge-Based Visual Question Answering
    Wang, Ruiping
    Wu, Shihong
    Wang, Xiaoping
    SUSTAINABILITY, 2022, 14 (20)
  • [49] Rethinking Knowledge Graph Evaluation Under the Open-World Assumption
    Yang, Haotong
    Lin, Zhouchen
    Zhang, Muhan
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [50] REVIVE: Regional Visual Representation Matters in Knowledge-Based Visual Question Answering
    Lin, Yuanze
    Xie, Yujia
    Chen, Dongdong
    Xu, Yichong
    Zhu, Chenguang
    Yuan, Lu
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,