Open Domain Dialogue Generation with Latent Images

被引:0
|
作者
Yang, Ze [1 ]
Wu, Wei [2 ]
Hu, Huang [3 ]
Xu, Can [3 ]
Wang, Wei [4 ]
Li, Zhoujun [1 ]
机构
[1] Beihang Univ, State Key Lab Software Dev Environm, Beijing, Peoples R China
[2] Meituan, Beijing, Peoples R China
[3] Microsoft, Beijing, Peoples R China
[4] China Resources Grp, Shenzhen, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We consider grounding open domain dialogues with images. Existing work assumes that both an image and a textual context are available, but image-grounded dialogues by nature are more difficult to obtain than textual dialogues. Thus, we propose learning a response generation model with both image-grounded dialogues and textual dialogues by assuming that the visual scene information at the time of a conversation can be represented by an image, and trying to recover the latent images of the textual dialogues through text-to-image generation techniques. The likelihood of the two types of dialogues is then formulated by a response generator and an image reconstructor that are learned within a conditional variational auto-encoding framework. Empirical studies are conducted in both image-grounded conversation and text-based conversation. In the first scenario, image-grounded dialogues, especially under a low-resource setting, can be effectively augmented by textual dialogues with latent images; while in the second scenario, latent images can enrich the content of responses and at the same time keep them relevant to contexts.
引用
下载
收藏
页码:14239 / 14247
页数:9
相关论文
共 50 条
  • [41] MALA: Cross-Domain Dialogue Generation with Action Learning
    Huang, Xinting
    Qi, Jianzhong
    Sun, Yu
    Zhang, Rui
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 7977 - 7984
  • [42] Towards Domain Adaptation for Neural Network Language Generation in Dialogue
    Van-Khanh Tran
    Van-Tao Nguyen
    Kiyoaki Shirai
    Minh-Le Nguyen
    2017 4TH NAFOSTED CONFERENCE ON INFORMATION AND COMPUTER SCIENCE (NICS), 2017, : 19 - 24
  • [43] PLATO: Pre-trained Dialogue Generation Model with Discrete Latent Variable
    Bao, Siqi
    He, Huang
    Wang, Fan
    Wu, Hua
    Wang, Haifeng
    58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), 2020, : 85 - 96
  • [44] CoLV: A Collaborative Latent Variable Model for Knowledge-Grounded Dialogue Generation
    Zhan, Haolan
    Shen, Lei
    Chen, Hongshen
    Zhang, Hainan
    2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 2250 - 2261
  • [45] Dirichlet Latent Variable Hierarchical Recurrent Encoder-Decoder in Dialogue Generation
    Zeng, Min
    Wang, Yisen
    Luo, Yuan
    2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 1267 - 1272
  • [46] Latent Retrieval for Weakly Supervised Open Domain Question Answering
    Lee, Kenton
    Chang, Ming-Wei
    Toutanova, Kristina
    57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 6086 - 6096
  • [47] SlugNERDS: A Named Entity Recognition Tool for Open Domain dialogue Systems
    Bowden, Kevin K.
    Wu, Jiaqi
    Oraby, Shereen
    Misra, Amita
    Walker, Marilyn
    PROCEEDINGS OF THE ELEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2018), 2018, : 4462 - 4469
  • [48] OPENEL: An Annotated Corpus for Entity Linking and Discourse in Open Domain Dialogue
    Cui, Wen
    Rolston, Leanne
    Walker, Marilyn A.
    Hockey, Beth Ann
    LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 2245 - 2256
  • [49] An Empirical Study on the Overlapping Problem of Open-Domain Dialogue Datasets
    Wen, Yuqiao
    Luo, Guoqing
    Mou, Lili
    LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 146 - 153
  • [50] An Automatic Evaluation Method for Open-domain Dialogue Based on BLEURT
    Wu, Shih-Hung
    Lee, Jia-Jun
    2022 IEEE 23RD INTERNATIONAL CONFERENCE ON INFORMATION REUSE AND INTEGRATION FOR DATA SCIENCE (IRI 2022), 2022, : 83 - 89