Variational Memory Encoder-Decoder

被引:0
|
作者
Hung Le [1 ]
Truyen Tran [1 ]
Thin Nguyen [1 ]
Venkatesh, Svetha [1 ]
机构
[1] Deakin Univ, Appl AI Inst, Geelong, Vic, Australia
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Introducing variability while maintaining coherence is a core task in learning to generate utterances in conversation. Standard neural encoder-decoder models and their extensions using conditional variational autoencoder often result in either trivial or digressive responses. To overcome this, we explore a novel approach that injects variability into neural encoder-decoder via the use of external memory as a mixture model, namely Variational Memory Encoder-Decoder (VMED). By associating each memory read with a mode in the latent mixture distribution at each timestep, our model can capture the variability observed in sequential data such as natural conversations. We empirically compare the proposed model against other recent approaches on various conversational datasets. The results show that VMED consistently achieves significant improvement over others in both metric-based and qualitative evaluations.
引用
收藏
页数:11
相关论文
共 50 条
  • [41] Encoder-Decoder Optimization for Brain-Computer Interfaces
    Merel, Josh
    Pianto, Donald M.
    Cunningham, John P.
    Paninski, Liam
    [J]. PLOS COMPUTATIONAL BIOLOGY, 2015, 11 (06)
  • [42] Ensemble Encoder-Decoder Models for Predicting Land Transformation
    Pourmohammadi, Pariya
    Strager, Michael P.
    Adjeroh, Donald A.
    [J]. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, 2021, 14 : 11429 - 11438
  • [43] SCNet: A Simplified Encoder-Decoder CNN for Semantic Segmentation
    Yasrab, Robail
    Gu, Naijie
    Zhang, Xiaoci
    [J]. PROCEEDINGS OF 2016 5TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT), 2016, : 785 - 789
  • [44] Variational Encoder-Decoder Recurrent Neural Network (VED-RNN) for anomaly prediction in a host environment
    Bouzar-Benlabiod, Lydia
    Meziani, Lila
    Rubin, Stuart H.
    Belaidi, Kahina
    Haddar, Nour Elhouda
    [J]. 2019 IEEE 20TH INTERNATIONAL CONFERENCE ON INFORMATION REUSE AND INTEGRATION FOR DATA SCIENCE (IRI 2019), 2019, : 75 - 82
  • [45] An Improved and Robust Encoder-Decoder for Skin Lesion Segmentation
    Hafhouf, Bellal
    Zitouni, Athmane
    Megherbi, Ahmed Chaouki
    Sbaa, Salim
    [J]. ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2022, 47 (08) : 9861 - 9875
  • [46] KILM: Knowledge Injection into Encoder-Decoder Language Models
    Xu, Yan
    Namazifar, Mahdi
    Hazarika, Devamanyu
    Padmakumar, Aishwarya
    Liu, Yang
    Hakkani-Tur, Dilek
    [J]. PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 1, 2023, : 5013 - 5035
  • [47] Experimental Research on Encoder-Decoder Architectures with Attention for Chatbots
    Costa-jussa, Marta R.
    Nuez, Alvaro
    Segura, Carlos
    [J]. COMPUTACION Y SISTEMAS, 2018, 22 (04): : 1233 - 1239
  • [48] SPEECH-TO-SINGING CONVERSION IN AN ENCODER-DECODER FRAMEWORK
    Parekh, Jayneel
    Rao, Preeti
    Yang, Yi-Hsuan
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 261 - 265
  • [49] An Encoder-Decoder Approach to the Paradigm Cell Filling Problem
    Silfverberg, Miikka
    Hulden, Mans
    [J]. 2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 2883 - 2889
  • [50] An Improved Encoder-Decoder Network for Ore Image Segmentation
    Yang, Hao
    Huang, Chao
    Wang, Long
    Luo, Xiong
    [J]. IEEE SENSORS JOURNAL, 2021, 21 (10) : 11469 - 11475