Convolutional State Space Models for Long-Range Spatiotemporal Modeling

被引:0
|
作者
Smith, Jimmy T. H. [2 ,4 ]
De Mello, Shalini [1 ]
Kautz, Jan [1 ]
Linderman, Scott W. [3 ,4 ]
Byeon, Wonmin [1 ]
机构
[1] NVIDIA, Santa Clara, CA USA
[2] Stanford Univ, Inst Computat & Math Engn, Stanford, CA 94305 USA
[3] Stanford Univ, Dept Stat, Stanford, CA 94305 USA
[4] Stanford Univ, Wu Tsai Neurosci Inst, Stanford, CA 94305 USA
关键词
TIME;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Effectively modeling long spatiotemporal sequences is challenging due to the need to model complex spatial correlations and long-range temporal dependencies simultaneously. ConvLSTMs attempt to address this by updating tensor-valued states with recurrent neural networks, but their sequential computation makes them slow to train. In contrast, Transformers can process an entire spatiotemporal sequence, compressed into tokens, in parallel. However, the cost of attention scales quadratically in length, limiting their scalability to longer sequences. Here, we address the challenges of prior methods and introduce convolutional state space models (ConvSSM)(1) that combine the tensor modeling ideas of ConvLSTM with the long sequence modeling approaches of state space methods such as S4 and S5. First, we demonstrate how parallel scans can be applied to convolutional recurrences to achieve subquadratic parallelization and fast autoregressive generation. We then establish an equivalence between the dynamics of ConvSSMs and SSMs, which motivates parameterization and initialization strategies for modeling long-range dependencies. The result is ConvS5, an efficient ConvSSM variant for long-range spatiotemporal modeling. ConvS5 significantly outperforms Transformers and ConvLSTM on a long horizon Moving-MNIST experiment while training 3x faster than ConvLSTM and generating samples 400x faster than Transformers. In addition, ConvS5 matches or exceeds the performance of state-of-the-art methods on challenging DMLab, Minecraft and Habitat prediction benchmarks and enables new directions for modeling long spatiotemporal sequences.
引用
收藏
页数:40
相关论文
共 50 条
  • [31] CRITICAL INDEXES FOR MODELS WITH LONG-RANGE INTERACTION
    MISSAROV, MD
    THEORETICAL AND MATHEMATICAL PHYSICS, 1981, 46 (02) : 153 - 160
  • [32] DETERMINISTIC EPIDEMIC MODELS WITH LONG-RANGE TRANSPORT
    ARONSON, DG
    ADVANCES IN APPLIED PROBABILITY, 1980, 12 (03) : 560 - 560
  • [33] Long-Range Language Modeling with Selective Cache
    Huang, Xinting
    Hollenstein, Nora
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS - EMNLP 2023, 2023, : 4838 - 4858
  • [34] ELEMENTS OF CORPORATE MODELS FOR LONG-RANGE PLANNING
    SIKORA, K
    MANS, G
    ANGEWANDTE INFORMATIK, 1974, (04): : 191 - 195
  • [35] CLUSTER MODELS AND LONG-RANGE AZIMUTHAL CORRELATIONS
    UGAZ, E
    LETTERE AL NUOVO CIMENTO, 1976, 15 (14): : 489 - 494
  • [36] LONG-RANGE ORDER IN ANTIFERROMAGNETIC GROUND STATE
    THOULESS, DJ
    PROCEEDINGS OF THE PHYSICAL SOCIETY OF LONDON, 1967, 90 (567P): : 243 - &
  • [37] MULTIPLE STATE STOCHASTIC-MODELS FOR THE LONG-RANGE TRANSPORT AND REMOVAL OF ATMOSPHERIC TRACERS
    EGBERT, GD
    BAKER, MB
    QUARTERLY JOURNAL OF THE ROYAL METEOROLOGICAL SOCIETY, 1986, 112 (473) : 843 - 865
  • [38] Long-range order and symmetry breaking in projected entangled-pair state models
    Rispler, Manuel
    Duivenvoorden, Kasper
    Schuch, Norbert
    PHYSICAL REVIEW B, 2015, 92 (15):
  • [39] Peptide models of local and long-range interactions in the molten globule state of human α-lactalbumin
    Demarest, SJ
    Fairman, R
    Raleigh, DP
    JOURNAL OF MOLECULAR BIOLOGY, 1998, 283 (01) : 279 - 291
  • [40] Local and Long-range Convolutional LSTM Network: A novel multi-step wind speed prediction approach for modeling local and long-range spatial correlations based on ConvLSTM
    Yu, Mei
    Tao, Boan
    Li, Xuewei
    Liu, Zhiqiang
    Xiong, Wei
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 130