End-To-End Memory Networks

被引:0
|
作者
Sukhbaatar, Sainbayar [1 ]
Szlam, Arthur [2 ]
Weston, Jason [2 ]
Fergus, Rob [2 ]
机构
[1] NYU, Courant Inst, Dept Comp Sci, New York, NY 10003 USA
[2] Facebook AI Res, New York, NY USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We introduce a neural network with a recurrent attention model over a possibly large external memory. The architecture is a form of Memory Network [23] but unlike the model in that work, it is trained end-to-end, and hence requires significantly less supervision during training, making it more generally applicable in realistic settings. It can also be seen as an extension of RNNsearch [2] to the case where multiple computational steps (hops) are performed per output symbol. The flexibility of the model allows us to apply it to tasks as diverse as (synthetic) question answering [22] and to language modeling. For the former our approach is competitive with Memory Networks, but with less supervision. For the latter, on the Penn TreeBank and Text8 datasets our approach demonstrates comparable performance to RNNs and LSTMs. In both cases we show that the key concept of multiple computational hops yields improved results.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] Gated End-to-End Memory Networks
    Liu, Fei
    Perez, Julien
    [J]. 15TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EACL 2017), VOL 1: LONG PAPERS, 2017, : 1 - 10
  • [2] SUPPORTIVE ATTENTION IN END-TO-END MEMORY NETWORKS
    Chien, Jen-Tzung
    Lin, Ting-An
    [J]. 2018 IEEE 28TH INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2018,
  • [3] End-to-End Bayesian Networks Exact Learning in Shared Memory
    Karan, Subhadeep
    Sayed, Zainul Abideen
    Zola, Jaroslaw
    [J]. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2024, 35 (04) : 634 - 645
  • [4] Convolutional End-to-End Memory Networks for Multi-Hop Reasoning
    Yang, Xiaoqing
    Fan, Pingzhi
    [J]. IEEE ACCESS, 2019, 7 : 135268 - 135276
  • [5] End-to-end over interplanetary networks
    Filman, RE
    [J]. IEEE INTERNET COMPUTING, 2003, 7 (05) : 4 - 5
  • [6] End-to-End Memory Behavior Profiling with DINAMITE
    Miucin, Svetozar
    Brady, Conor
    Fedorova, Alexandra
    [J]. FSE'16: PROCEEDINGS OF THE 2016 24TH ACM SIGSOFT INTERNATIONAL SYMPOSIUM ON FOUNDATIONS OF SOFTWARE ENGINEERING, 2016, : 1042 - 1046
  • [7] Opportunistic Networks: end-to-end performance analysis)
    Santos, Rodrigo
    Orozco, Javier
    [J]. 2011 BRAZILIAN SYMPOSIUM ON COMPUTING SYSTEM ENGINEERING (SBESC), 2011, : 90 - 92
  • [8] End-to-End Delay in Wireless Random Networks
    Yu, Seung Min
    Kim, Seong-Lyun
    [J]. IEEE COMMUNICATIONS LETTERS, 2010, 14 (02) : 109 - 111
  • [9] Characterization of the End-to-End Delay in Heterogeneous Networks
    Fadhil, Diyar
    Oliveira, Rodolfo
    [J]. PROCEEDINGS OF THE 2021 12TH INTERNATIONAL CONFERENCE ON NETWORK OF THE FUTURE (NOF 2021), 2021,
  • [10] Forward End-to-End Delay for AFDX Networks
    Benammar, Nassima
    Ridouard, Frederic
    Bauer, Henri
    Richard, Pascal
    [J]. IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2018, 14 (03) : 858 - 865