You should know more: Learning external knowledge for visual dialog

被引:3
|
作者
Zhao, Lei [1 ,2 ]
Zhang, Haonan [1 ]
Li, Xiangpeng [1 ]
Yang, Sen [1 ]
Song, Yuanfeng [1 ]
机构
[1] Univ Elect Sci & Technol China, Sch Comp Sci & Engn, Chengdu, Peoples R China
[2] Sichuan Artificial Intelligence Res Inst, Yibin, Peoples R China
关键词
External knowledge; Graph convolutional network; Structured knowledge graph; Attention mechanism; Visual dialog;
D O I
10.1016/j.neucom.2021.10.121
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Visual dialog is a task that two agents complete a multi-round conversation based on an image, a caption, and dialog histories. Despite the recent progress, existing methods still undergo degradation on the condition of complex scenarios. Handling these scenarios depends on logical reasoning that requires commonsense priors. In this paper, we propose a novel visual dialog pipeline named Structured Knowledge-Aware Network (SKANet), consisting of an Image Knowledge-Aware Module and a Caption Knowledge-Aware Module. Specifically, the Image and Caption Knowledge-Aware Modules construct commonsense knowledge graphs from ConceptNet. We apply SKANet to two sub-tasks: the conventional visual dialog and a goal-oriented visual dialog named 'image guessing'. For the conventional visual dialog, the SKANet is combined with an additional Multi-Modality Fusion Module, which is designed to explore the visual content and the textual context about the dialog history. For the goal-oriented visual dialog, we directly apply the Image and Caption Knowledge-Aware Modules to two agents, respectively. Experimental results on VisDial v0.9 and VisDial v1.0 datasets show that our proposed method effectively outperforms comparative methods on both sub-tasks.(c) 2022 Elsevier B.V. All rights reserved.
引用
收藏
页码:54 / 65
页数:12
相关论文
共 50 条
  • [1] What you should know to survive in knowledge societies: On a semiotic understanding of 'knowledge'
    Hoffmann, MHG
    Roth, WM
    SEMIOTICA, 2005, 157 (1-4) : 105 - 142
  • [2] YOU SHOULD KNOW
    CHANDLER, HE
    METAL PROGRESS, 1979, 115 (04): : 11 - 11
  • [4] The More You Know: Using Knowledge Graphs for Image Classification
    Marino, Kenneth
    Salakhutdinov, Ruslan
    Gupta, Abhinav
    30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 20 - 28
  • [5] 'More then You Know'
    Harlan, M
    NEW YORK TIMES BOOK REVIEW, 2000, : 23 - 23
  • [6] The More You Know
    Lanzendorfer, Joy
    RARITAN-A QUARTERLY REVIEW, 2020, 39 (04): : 97 - 101
  • [7] Reasoning Visual Dialog with Sparse Graph Learning and Knowledge Transfer
    Kang, Gi-Cheon
    Park, Junseok
    Lee, Hwaran
    Zhang, Byoung-Tak
    Kim, Jin-Hwa
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2021, 2021, : 327 - 339
  • [8] Glyco you should know
    Massman, Lilyanna C.
    GLYCOBIOLOGY, 2024, 34 (11)
  • [9] Glyco You Should Know
    Kukan, Emily
    GLYCOBIOLOGY, 2024, 34 (05)
  • [10] 'Things You Should Know'
    Abell, S
    TLS-THE TIMES LITERARY SUPPLEMENT, 2003, (5224): : 21 - 21