Knowledge-aware Multimodal Dialogue Systems

被引:84
|
作者
Liao, Lizi [1 ]
Ma, Yunshan [1 ]
He, Xiangnan [1 ]
Hong, Richang [2 ]
Chua, Tat-Seng [1 ]
机构
[1] Natl Univ Singapore, Singapore, Singapore
[2] Hefei Univ Technol, Hefei, Peoples R China
基金
新加坡国家研究基金会; 美国国家科学基金会;
关键词
Multimodal Dialogue; Domain Knowledge; Fashion;
D O I
10.1145/3240508.3240605
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
By offering a natural way for information seeking, multimodal dialogue systems are attracting increasing attention in several domains such as retail, travel etc. However, most existing dialogue systems are limited to textual modality, which cannot be easily extended to capture the rich semantics in visual modality such as product images. For example, in fashion domain, the visual appearance of clothes and matching styles play a crucial role in understanding the user's intention. Without considering these, the dialogue agent may fail to generate desirable responses for users. In this paper, we present a Knowledge-aware Multimodal Dialogue (KMD) model to address the limitation of text-based dialogue systems. It gives special consideration to the semantics and domain knowledge revealed in visual content, and is featured with three key components. First, we build a taxonomy-based learning module to capture the fine-grained semantics in images (e.g., the category and attributes of a product). Second, we propose an end-to-end neural conversational model to generate responses based on the conversation history, visual semantics, and domain knowledge. Lastly, to avoid inconsistent dialogues, we adopt a deep reinforcement learning method which accounts for future rewards to optimize the neural conversational model. We perform extensive evaluation on a multi-turn task-oriented dialogue dataset in fashion domain. Experiment results show that our method significantly outperforms state-of-the-art methods, demonstrating the efficacy of modeling visual modality and domain knowledge for dialogue systems.
引用
收藏
页码:801 / 809
页数:9
相关论文
共 50 条
  • [1] Knowledge-aware Multimodal Fashion Chatbot
    Liao, Lizi
    Zhou, You
    Ma, Yunshan
    Hong, Richang
    Chua, Tat-Seng
    [J]. PROCEEDINGS OF THE 2018 ACM MULTIMEDIA CONFERENCE (MM'18), 2018, : 1265 - 1266
  • [2] Knowledge-aware and Conversational Recommender Systems
    Anelli, Vito Walter
    Basile, Pierpaolo
    Bridge, Derek
    Di Noia, Tommaso
    Lops, Pasquale
    Musto, Cataldo
    Narducci, Fedelucio
    Zanker, Markus
    [J]. 12TH ACM CONFERENCE ON RECOMMENDER SYSTEMS (RECSYS), 2018, : 521 - 522
  • [3] Accountable Knowledge-aware Recommender Systems
    Lops, Pasquale
    Musto, Cataldo
    Polignano, Marco
    [J]. 2023 PROCEEDINGS OF THE 31ST ACM CONFERENCE ON USER MODELING, ADAPTATION AND PERSONALIZATION, UMAP 2023, 2023, : 306 - 308
  • [4] Knowledge-aware Dialogue Generation with Hybrid Attention (Student Abstract)
    Zhao, Yaru
    Cheng, Bo
    Zhang, Yingying
    [J]. THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 15951 - 15952
  • [5] Knowledge-aware Attentive Wasserstein Adversarial Dialogue Response Generation
    Zhang, Yingying
    Fang, Quan
    Qian, Shengsheng
    Xu, Changsheng
    [J]. ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2020, 11 (04)
  • [6] Video summarization via knowledge-aware multimodal deep networks
    Xie, Jiehang
    Chen, Xuanbai
    Zhao, Sicheng
    Lu, Shao-Ping
    [J]. KNOWLEDGE-BASED SYSTEMS, 2024, 293
  • [7] Improving Knowledge-Aware Dialogue Generation via Knowledge Base Question Answering
    Wang, Jian
    Liu, Junhao
    Bi, Wei
    Liu, Xiaojiang
    He, Kejing
    Xu, Ruifeng
    Yang, Min
    [J]. THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 9169 - 9176
  • [8] A survey on knowledge-aware news recommender systems
    Iana, Andreea
    Alam, Mehwish
    Paulheim, Heiko
    [J]. SEMANTIC WEB, 2024, 15 (01) : 21 - 82
  • [9] Generating Rational Commonsense Knowledge-Aware Dialogue Responses With Channel-Aware Knowledge Fusing Network
    Wu, Sixing
    Li, Ying
    Zhang, Dawei
    Wu, Zhonghai
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 30 : 3230 - 3239
  • [10] Knowledge-aware Autoencoders for Explainable Recommender Systems
    Bellini, Vito
    Schiavone, Angelo
    Di Noia, Tommaso
    Ragone, Azzurra
    Di Sciascio, Eugenio
    [J]. PROCEEDINGS OF THE 3RD WORKSHOP ON DEEP LEARNING FOR RECOMMENDER SYSTEMS (DLRS), 2018, : 24 - 31