A discrete probabilistic memory model for discovering dependencies in time

被引:0
|
作者
Hochreiter, S [1 ]
Mozer, MC [1 ]
机构
[1] Univ Colorado, Dept Comp Sci, Boulder, CO 80309 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Many domains of machine learning involve discovering dependencies and structure over time. In the most complex of domains, long-term temporal dependencies are present. Neural network models such as LSTM have been developed to deal with long-term dependencies, but the continuous nature of neural networks is not well suited to discrete symbol processing tasks. Further, the mathematical underpinnings of neural networks are unclear, and gradient descent learning of recurrent neural networks seems particularly susceptible to local optima. We introduce a novel architecture for discovering dependencies in time. The architecture is formed by combining two variants of a hidden Markov model (HMM) - the factorial HMM and the input-output HMM - and adding a further strong constraint that requires the model to behave as a latch-and-store memory (the same constraint exploited in LSTM). This model, called an MIOFHMM, can learn structure that other variants of the HMM cannot, and can generalize better than LSTM on test sequences that have different statistical properties (different lengths, different types of noise) than training sequences. However, the MIOFHMM is slower to train and is more susceptible to local optima than LSTM.
引用
收藏
页码:661 / 668
页数:8
相关论文
共 50 条
  • [1] Modelling Activity Global Temporal Dependencies using Time Delayed Probabilistic Graphical Model
    Loy, Chen Change
    xiang, Tao
    Gong, Shaogang
    [J]. 2009 IEEE 12TH INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2009, : 120 - 127
  • [2] A Probabilistic Behavior Model for Discovering Unrecognized Knowledge
    Kurashima, Takeshi
    Iwata, Tomoharu
    Takaya, Noriko
    Sawada, Hiroshi
    [J]. 2013 IEEE 13TH INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2013, : 1097 - 1102
  • [3] PROBABILISTIC PROPERTIES OF A GENERAL CYCLIC MARKOVIAN MODEL IN DISCRETE-TIME
    JACOB, C
    [J]. STOCHASTIC PROCESSES AND THEIR APPLICATIONS, 1985, 19 (01) : 17 - 18
  • [4] Discovering Multiple Time Lags of Temporal Dependencies from Fluctuating Events
    Wang, Wentao
    Zeng, Chunqiu
    Li, Tao
    [J]. WEB AND BIG DATA (APWEB-WAIM 2018), PT II, 2018, 10988 : 121 - 137
  • [5] Discovering Conditional Functional Dependencies
    Fan, Wenfei
    Geerts, Floris
    Li, Jianzhong
    Xiong, Ming
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2011, 23 (05) : 683 - 698
  • [6] Discovering Conditional Functional Dependencies
    Fan, Wenfei
    Geerts, Floris
    Lakshmanan, Laks V. S.
    Xiong, Ming
    [J]. ICDE: 2009 IEEE 25TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, VOLS 1-3, 2009, : 1231 - +
  • [7] Discovering Graph Differential Dependencies
    Zhang, Yidi
    Kwashie, Selasi
    Bewong, Michael
    Hu, Junwei
    Mahboubi, Arash
    Guo, Xi
    Feng, Zaiwen
    [J]. DATABASES THEORY AND APPLICATIONS, ADC 2023, 2024, 14386 : 259 - 272
  • [8] Discovering Band Order Dependencies
    Li, Pei
    Szlichta, Jaroslaw
    Bohlen, Michael
    Srivastava, Divesh
    [J]. 2020 IEEE 36TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2020), 2020, : 1878 - 1881
  • [9] Discovering dependencies in sound descriptors
    Wieczorkowska, AA
    Zytkow, JM
    [J]. INTELLIGENT INFORMATION PROCESSING AND WEB MINING, 2003, : 431 - 438
  • [10] Discovering Graph Functional Dependencies
    Fan, Wenfei
    Hu, Chunming
    Liu, Xueli
    Lu, Ping
    [J]. ACM TRANSACTIONS ON DATABASE SYSTEMS, 2020, 45 (03):