A discrete probabilistic memory model for discovering dependencies in time

被引:0
|
作者
Hochreiter, S [1 ]
Mozer, MC [1 ]
机构
[1] Univ Colorado, Dept Comp Sci, Boulder, CO 80309 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Many domains of machine learning involve discovering dependencies and structure over time. In the most complex of domains, long-term temporal dependencies are present. Neural network models such as LSTM have been developed to deal with long-term dependencies, but the continuous nature of neural networks is not well suited to discrete symbol processing tasks. Further, the mathematical underpinnings of neural networks are unclear, and gradient descent learning of recurrent neural networks seems particularly susceptible to local optima. We introduce a novel architecture for discovering dependencies in time. The architecture is formed by combining two variants of a hidden Markov model (HMM) - the factorial HMM and the input-output HMM - and adding a further strong constraint that requires the model to behave as a latch-and-store memory (the same constraint exploited in LSTM). This model, called an MIOFHMM, can learn structure that other variants of the HMM cannot, and can generalize better than LSTM on test sequences that have different statistical properties (different lengths, different types of noise) than training sequences. However, the MIOFHMM is slower to train and is more susceptible to local optima than LSTM.
引用
收藏
页码:661 / 668
页数:8
相关论文
共 50 条
  • [41] Synchronization for discrete-time complex networks with probabilistic time delays
    Cheng, Ranran
    Peng, Mingshu
    Yu, Jinchen
    Li, Haifen
    [J]. PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2019, 525 : 1088 - 1101
  • [42] A Probabilistic Model for Discovering High Level Brain Activities from fMRI
    Li, Jun
    Tao, Dacheng
    [J]. NEURAL INFORMATION PROCESSING, PT I, 2011, 7062 : 329 - 336
  • [43] Discovering spatio-temporal dependencies based on time-lag in intelligent transportation data
    Zhou, Xiabing
    Hong, Haikun
    Xing, Xingxing
    Bian, Kaigui
    Xie, Kunqing
    Xu, Mingliang
    [J]. NEUROCOMPUTING, 2017, 259 : 76 - 84
  • [44] A Discrete-Time State Observer Approach to Discovering Portfolio Holdings
    Georges, D.
    Girerd-Potin, I.
    [J]. IFAC PAPERSONLINE, 2017, 50 (01): : 946 - 951
  • [45] Discovering operational signatures with time constraints from a discrete event sequence
    Bouché, P
    Le Goc, M
    [J]. HIS'04: FOURTH INTERNATIONAL CONFERENCE ON HYBRID INTELLIGENT SYSTEMS, PROCEEDINGS, 2005, : 55 - 60
  • [46] The membership problem for probabilistic and data dependencies
    Wong, SKM
    Butz, CJ
    [J]. TECHNOLOGIES FOR CONSTRUCTING INTELLIGENT SYSTEMS 2: TOOLS, 2002, 90 : 73 - 84
  • [47] PROBABILISTIC GRAPHICAL MODEL FOR FLASH MEMORY PROGRAMMING
    Peleato, Borja
    Agarwal, Rajiv
    Cioffi, John
    [J]. 2012 IEEE STATISTICAL SIGNAL PROCESSING WORKSHOP (SSP), 2012, : 788 - 791
  • [48] Probabilistic memory model for visual images categorization
    Xiao L.
    Wang Y.
    Liu B.
    Liu W.
    [J]. Computing and Informatics, 2021, 39 (06) : 1229 - 1249
  • [49] Discovering context-aware conditional functional dependencies
    Yuefeng Du
    Derong Shen
    Tiezheng Nie
    Yue Kou
    Ge Yu
    [J]. Frontiers of Computer Science, 2017, 11 : 688 - 701
  • [50] A Probabilistic Model for Sign Language Translation Memory
    Othman, Achraf
    Jemni, Mohamed
    [J]. INTELLIGENT INFORMATICS, 2013, 182 : 317 - 324