Variational Memory Encoder-Decoder

被引:0
|
作者
Hung Le [1 ]
Truyen Tran [1 ]
Thin Nguyen [1 ]
Venkatesh, Svetha [1 ]
机构
[1] Deakin Univ, Appl AI Inst, Geelong, Vic, Australia
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Introducing variability while maintaining coherence is a core task in learning to generate utterances in conversation. Standard neural encoder-decoder models and their extensions using conditional variational autoencoder often result in either trivial or digressive responses. To overcome this, we explore a novel approach that injects variability into neural encoder-decoder via the use of external memory as a mixture model, namely Variational Memory Encoder-Decoder (VMED). By associating each memory read with a mode in the latent mixture distribution at each timestep, our model can capture the variability observed in sequential data such as natural conversations. We empirically compare the proposed model against other recent approaches on various conversational datasets. The results show that VMED consistently achieves significant improvement over others in both metric-based and qualitative evaluations.
引用
收藏
页数:11
相关论文
共 50 条
  • [21] Explainable gait recognition with prototyping encoder-decoder
    Moon, Jucheol
    Shin, Yong-Min
    Park, Jin-Duk
    Minaya, Nelson Hebert
    Shin, Won-Yong
    Choi, Sang-Il
    [J]. PLOS ONE, 2022, 17 (03):
  • [22] Parallel encoder-decoder framework for image captioning
    Saeidimesineh, Reyhane
    Adibi, Peyman
    Karshenas, Hossein
    Darvishy, Alireza
    [J]. KNOWLEDGE-BASED SYSTEMS, 2023, 282
  • [23] An encoder-decoder switch network for purchase prediction
    Park, Chanyoung
    Kim, Donghyun
    Yu, Hwanjo
    [J]. KNOWLEDGE-BASED SYSTEMS, 2019, 185
  • [24] Exemplar Encoder-Decoder for Neural Conversation Generation
    Pandey, Gaurav
    Contractor, Danish
    Kumar, Vineet
    Joshi, Sachindra
    [J]. PROCEEDINGS OF THE 56TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL), VOL 1, 2018, : 1329 - 1338
  • [25] On Mining Conditions using Encoder-decoder Networks
    Gallego, Fernando O.
    Corchuelo, Rafael
    [J]. PROCEEDINGS OF THE 11TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE (ICAART), VOL 2, 2019, : 624 - 630
  • [26] Timber Tracing with Multimodal Encoder-Decoder Networks
    Zolotarev, Fedor
    Eerola, Tuomas
    Lensu, Lasse
    Kalviainen, Heikki
    Haario, Heikki
    Heikkinen, Jere
    Kauppi, Tomi
    [J]. COMPUTER ANALYSIS OF IMAGES AND PATTERNS, CAIP 2019, PT II, 2019, 11679 : 342 - 353
  • [27] Attentive encoder-decoder networks for crowd counting
    Liu, Xuhui
    Hu, Yutao
    Zhang, Baochang
    Zhen, Xiantong
    Luo, Xiaoyan
    Cao, Xianbin
    [J]. NEUROCOMPUTING, 2022, 490 : 246 - 257
  • [28] Encoder-decoder network with RMP for tongue segmentation
    Worapan Kusakunniran
    Punyanuch Borwarnginn
    Sarattha Karnjanapreechakorn
    Kittikhun Thongkanchorn
    Panrasee Ritthipravat
    Pimchanok Tuakta
    Paitoon Benjapornlert
    [J]. Medical & Biological Engineering & Computing, 2023, 61 : 1193 - 1207
  • [29] Chaotic Encoder-Decoder on FPGA for Crypto System
    Roeksukrungrueang, Chanathip
    Dittaphong, Xaysamone
    Khongsomboon, Khamphong
    Panyanouyong, Nounchan
    Chivapreecha, Sorawat
    [J]. 2014 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2014,
  • [30] Molecular all-photonic encoder-decoder
    Andreasson, Joakim
    Straight, Stephen D.
    Moore, Thomas A.
    Moore, Ana L.
    Gust, Devens
    [J]. JOURNAL OF THE AMERICAN CHEMICAL SOCIETY, 2008, 130 (33) : 11122 - 11128