Implementing Neural Turing Machines

被引:26
|
作者
Collier, Mark [1 ]
Beel, Joeran [1 ]
机构
[1] Trinity Coll Dublin, Dublin, Ireland
关键词
Neural Turing Machines; Memory Augmented; Neural Networks;
D O I
10.1007/978-3-030-01424-7_10
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Neural Turing Machines (NTMs) are an instance of Memory Augmented Neural Networks, a new class of recurrent neural networks which decouple computation from memory by introducing an external memory unit. NTMs have demonstrated superior performance over Long Short- Term Memory Cells in several sequence learning tasks. A number of open source implementations of NTMs exist but are unstable during training and/ or fail to replicate the reported performance of NTMs. This paper presents the details of our successful implementation of a NTM. Our implementation learns to solve three sequential learning tasks from the original NTM paper. We find that the choice of memory contents initialization scheme is crucial in successfully implementing a NTM. Networks with memory contents initialized to small constant values converge on average 2 times faster than the next best memory contents initialization scheme.
引用
收藏
页码:94 / 104
页数:11
相关论文
共 50 条
  • [21] Concurrent turing machines
    Farwer, Berndt
    Kudlek, Manfred
    Roelke, Heiko
    FUNDAMENTA INFORMATICAE, 2007, 79 (3-4) : 303 - 317
  • [22] Turing Machines with Atoms
    Bojanczyk, Mikolaj
    Klin, Bartek
    Lasota, Slawomir
    Torunczyk, Szymon
    2013 28TH ANNUAL IEEE/ACM SYMPOSIUM ON LOGIC IN COMPUTER SCIENCE (LICS), 2013, : 183 - 192
  • [23] Zigzags in Turing Machines
    Gajardo, Anahi
    Guillon, Pierre
    COMPUTER SCIENCE - THEORY AND APPLICATIONS, 2010, 6072 : 109 - +
  • [24] Structured Turing Machines
    L. P. Lisovik
    Cybernetics and Systems Analysis, 2004, 40 (2) : 162 - 168
  • [25] Graph Turing Machines
    Ackerman, Nathanael L.
    Freer, Cameron E.
    LOGIC, LANGUAGE, INFORMATION, AND COMPUTATION: 24TH INTERNATIONAL WORKSHOP, WOLLIC 2017, LONDON, UK, JULY 18-21, 2017, PROCEEDINGS, 2017, 10388 : 1 - 13
  • [26] Token Turing Machines
    Ryoo, Michael S.
    Gopalakrishnan, Keerthana
    Kahatapitiya, Kumara
    Xiao, Ted
    Rao, Kanishka
    Stone, Austin
    Lu, Yao
    Ibarz, Julian
    Arnab, Anurag
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 19070 - 19081
  • [27] Noisy turing machines
    Asarin, E
    Collins, P
    AUTOMATA, LANGUAGES AND PROGRAMMING, PROCEEDINGS, 2005, 3580 : 1031 - 1042
  • [28] STRUCTURED TURING MACHINES
    PRATHER, RE
    INFORMATION AND CONTROL, 1977, 35 (02): : 159 - 171
  • [29] Agent-environment approach to the simulation of Turing Machines by Neural Networks
    de Oliveira, WR
    de Souto, MCP
    Ludermir, TB
    IJCNN'01: INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-4, PROCEEDINGS, 2001, : 71 - 76
  • [30] Involutory Turing Machines
    Nakano, Keisuke
    REVERSIBLE COMPUTATION (RC 2020), 2020, 12227 : 54 - 70