Inference of stochastic time series with missing data

被引:3
|
作者
Lee, Sangwon [1 ,7 ]
Periwal, Vipul [2 ]
Jo, Junghyo [3 ,4 ,5 ,6 ]
机构
[1] Seoul Natl Univ, Dept Phys & Astron, Seoul 08826, South Korea
[2] NIDDK, Lab Biol Modeling, NIH, Bethesda, MD 20892 USA
[3] Seoul Natl Univ, Dept Phys Educ, Seoul 08826, South Korea
[4] Seoul Natl Univ, Ctr Theoret Phys, Seoul 08826, South Korea
[5] Seoul Natl Univ, Artificial Intelligence Inst, Seoul 08826, South Korea
[6] Korea Inst Adv Study, Sch Computat Sci, Seoul 02455, South Korea
[7] Univ Gottingen, Max Planck Sch Matter Life, D-37077 Gottingen, Germany
基金
新加坡国家研究基金会; 美国国家卫生研究院;
关键词
MEAN-FIELD THEORY; MODEL;
D O I
10.1103/PhysRevE.104.024119
中图分类号
O35 [流体力学]; O53 [等离子体物理学];
学科分类号
070204 ; 080103 ; 080704 ;
摘要
Inferring dynamics from time series is an important objective in data analysis. In particular, it is challenging to infer stochastic dynamics given incomplete data. We propose an expectation maximization (EM) algorithm that iterates between alternating two steps: E-step restores missing data points, while M-step infers an underlying network model from the restored data. Using synthetic data of a kinetic Ising model, we confirm that the algorithm works for restoring missing data points as well as inferring the underlying model. At the initial iteration of the EM algorithm, the model inference shows better model-data consistency with observed data points than with missing data points. As we keep iterating, however, missing data points show better model-data consistency. We find that demanding equal consistency of observed and missing data points provides an effective stopping criterion for the iteration to prevent going beyond the most accurate model inference. Using the EM algorithm and the stopping criterion together, we infer missing data points from a time-series data of real neuronal activities. Our method reproduces collective properties of neuronal activities such as correlations and firing statistics even when 70% of data points are masked as missing points.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] HETEROSKEDASTICITY AUTOCORRELATION ROBUST INFERENCE IN TIME SERIES REGRESSIONS WITH MISSING DATA
    Rho, Seung-Hwa
    Vogelsang, Timothy J.
    [J]. ECONOMETRIC THEORY, 2019, 35 (03) : 601 - 629
  • [2] Time series analysis on precipitation with missing data using stochastic SARIMA
    Sharma, M. K.
    Omer, Mohammed
    Kiani, Sara
    [J]. MAUSAM, 2020, 71 (04): : 617 - 624
  • [3] Inference for time series and stochastic processes
    Chan, Ngai Hang
    [J]. STATISTICA SINICA, 2006, 16 (03) : 683 - 696
  • [4] Time Series Forecasting with Missing Data Using Generative Adversarial Networks and Bayesian Inference
    Li, Xiaoou
    [J]. INFORMATION, 2024, 15 (04)
  • [5] Nonstationary Time Series and Missing Data
    Shin, Dong Wan
    Lee, Oesook
    [J]. KOREAN JOURNAL OF APPLIED STATISTICS, 2010, 23 (01) : 73 - 79
  • [6] Stochastic Variational Inference for Bayesian Time Series Models
    Johnson, Matthew James
    Willsky, Alan S.
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 32 (CYCLE 2), 2014, 32 : 1854 - 1862
  • [7] Spectral modeling of time series with missing data
    Rodrigues, Paulo C.
    de Carvalho, Miguel
    [J]. APPLIED MATHEMATICAL MODELLING, 2013, 37 (07) : 4676 - 4684
  • [8] Online Time Series Prediction with Missing Data
    Anava, Oren
    Hazan, Elad
    Zeevi, Assaf
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 37, 2015, 37 : 2191 - 2199
  • [9] EM-Based Inference for Cyclostationary Time Series with Missing Observations
    Drake, Christiana
    Knapik, Oskar
    Leskow, Jacek
    [J]. CYCLOSTATIONARITY: THEORY AND METHODS, 2014, : 23 - 35
  • [10] Nonparametric inference for unbalanced time series data
    Linton, O
    [J]. ECONOMETRIC THEORY, 2005, 21 (01) : 143 - 157