Augmented Commonsense Knowledge for Remote Object Grounding

被引:0
|
作者
Mohammadi, Bahram [1 ]
Hong, Yicong [2 ]
Qi, Yuankai [3 ]
Wu, Qi [1 ]
Pan, Shirui [4 ]
Shi, Javen Qinfeng [1 ]
机构
[1] Univ Adelaide, Australian Inst Machine Learning AIML, Adelaide, SA, Australia
[2] Australian Natl Univ, Canberra, ACT, Australia
[3] Macquarie Univ, Sydney, NSW, Australia
[4] Griffith Univ, Nathan, Qld, Australia
关键词
LANGUAGE;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The vision-and-language navigation (VLN) task necessitates an agent to perceive the surroundings, follow natural language instructions, and act in photo-realistic unseen environments. Most of the existing methods employ the entire image or object features to represent navigable viewpoints. However, these representations are insufficient for proper action prediction, especially for the REVERIE task, which uses concise high-level instructions, such as "Bring me the blue cushion in the master bedroom". To address enhancing representation, we propose an augmented commonsense knowledge model (ACK) to leverage commonsense information as a spatio-temporal knowledge graph for improving agent navigation. Specifically, the proposed approach involves constructing a knowledge base by retrieving commonsense information from ConceptNet, followed by a refinement module to remove noisy and irrelevant knowledge. We further present ACK which consists of knowledge graph-aware crossmodal and concept aggregation modules to enhance visual representation and visual-textual data alignment by integrating visible objects, commonsense knowledge, and concept history, which includes object and knowledge temporal information. Moreover, we add a new pipeline for the commonsense-based decision-making process which leads to more accurate local action prediction. Experimental results demonstrate our proposed model noticeably outperforms the baseline and archives the state-of-the-art on the REVERIE benchmark. The source code is available at https://github.com/BahramMohammadi/ACK.
引用
收藏
页码:4269 / 4277
页数:9
相关论文
共 50 条
  • [1] Grounding commonsense knowledge in intelligent systems
    Daoutis, Marios
    Coradeshi, Silvia
    Loutfi, Amy
    JOURNAL OF AMBIENT INTELLIGENCE AND SMART ENVIRONMENTS, 2009, 1 (04) : 311 - 321
  • [2] Implicit knowledge-augmented prompting for commonsense explanation generation
    Ge, Yan
    Yu, Hai-Tao
    Lei, Chao
    Liu, Xin
    Jatowt, Adam
    Kim, Kyoung-sook
    Lynden, Steven
    Matono, Akiyoshi
    KNOWLEDGE AND INFORMATION SYSTEMS, 2025, : 3663 - 3698
  • [3] Retrieval-Augmented Knowledge Graph Reasoning for Commonsense Question Answering
    Sha, Yuchen
    Feng, Yujian
    He, Miao
    Liu, Shangdong
    Ji, Yimu
    MATHEMATICS, 2023, 11 (15)
  • [4] A Commonsense Knowledge-based Object Retrieval Approach for Virtual Reality
    Jiang, Haiyan
    Weng, Dongdong
    Dongye, Xiaonuo
    Zhang, Nan
    Le, Luo
    2023 IEEE CONFERENCE ON VIRTUAL REALITY AND 3D USER INTERFACES ABSTRACTS AND WORKSHOPS, VRW, 2023, : 795 - 796
  • [5] Strategies to Leverage Foundational Model Knowledge in Object Affordance Grounding
    Rai, Arushi
    Buettner, Kyle
    Kovashka, Adriana
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW, 2024, : 1714 - 1723
  • [6] KG-BART: Knowledge Graph-Augmented BART for Generative Commonsense Reasoning
    Liu, Ye
    Wan, Yao
    He, Lifang
    Peng, Hao
    Yu, Philip S.
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 6418 - 6425
  • [7] Commonsense Reasoning and Commonsense Knowledge in Artificial Intelligence
    Davis, Ernest
    Marcus, Gary
    COMMUNICATIONS OF THE ACM, 2015, 58 (09) : 92 - 103
  • [8] Dimensions of commonsense knowledge
    Ilievski, Filip
    Oltramari, Alessandro
    Ma, Kaixin
    Zhang, Bin
    McGuinness, Deborah L.
    Szekely, Pedro
    KNOWLEDGE-BASED SYSTEMS, 2021, 229
  • [9] THE COMMONSENSE OF OBJECT ORIENTED LANGUAGES
    FREEDMAN, RS
    COMPUTER DESIGN, 1983, 22 (02): : 111 - &
  • [10] Knowledge processing and commonsense
    Narasimhan, R
    KNOWLEDGE-BASED SYSTEMS, 1997, 10 (03) : 147 - 151