Statistical Learning of Discrete States in Time Series

被引:26
|
作者
Li, Hao [1 ]
Yang, Haw [1 ]
机构
[1] Princeton Univ, Dept Chem, Princeton, NJ 08544 USA
来源
JOURNAL OF PHYSICAL CHEMISTRY B | 2019年 / 123卷 / 03期
关键词
CONTINUOUS STOCHASTIC-PROCESSES; SINGLE-MOLECULE; TRAJECTORY ENTROPY; FLUORESCENCE; INFORMATION; DYNAMICS; SPECTROSCOPY; DIFFUSION; PHOTON; RATES;
D O I
10.1021/acs.jpcb.8b10561
中图分类号
O64 [物理化学(理论化学)、化学物理学];
学科分类号
070304 ; 081704 ;
摘要
Time series obtained from time-dependent experiments contain rich information on kinetics and dynamics of the system under investigation. This work describes an unsupervised learning framework, along with the derivation of the necessary analytical expressions, for the analysis of Gaussian-distributed time series that exhibit discrete states. After the time series has been partitioned into segments in a model-free manner using the previously developed change-point (CP) method, this protocol starts with an agglomerative hierarchical clustering algorithm to classify the detected segments into possible states. The initial state clustering is further refined using an expectation-maximization (EM) procedure, and the number of states is determined by a Bayesian information criterion (BIC). Also introduced here is an achievement scalarization function, usually seen in artificial intelligence literature, for quantitatively assessing the performance of state determination. The statistical learning framework, which is comprised of three stages, detection of signal change, clustering, and number-of-state determination, was thoroughly characterized using simulated trajectories with random intensity segments that have no underlying kinetics, and its performance was critically evaluated. The application to experimental data is also demonstrated. The results suggested that this general framework, the implementation of which is based on firm theoretical foundations and does not require the imposition of any kinetics model, is powerful in determining the number of states, the parameters contained in each state, as well as the associated statistical significance.
引用
收藏
页码:689 / 701
页数:13
相关论文
共 50 条
  • [41] A NOTE ON PREDICTION FOR DISCRETE TIME SERIES
    Morvai, Gusztav
    Weiss, Benjamin
    KYBERNETIKA, 2012, 48 (04) : 809 - 823
  • [42] Time series with discrete semistable marginals
    Nadjib Bouzar
    K. Jayakumar
    Statistical Papers, 2008, 49 : 619 - 635
  • [43] Comprehensive Prediction of Stock Prices Using Time Series, Statistical, Machine Learning, and Deep Learning Models
    Sen, Jaydip
    Kumar, Abhishek
    Thomas, Aji
    Todi, Nishant Kumar
    Olemmyan, Olive
    Tripathi, Swapnil
    Arora, Vaibhav
    TechRxiv, 2023,
  • [44] Learning Arbitrary Statistical Mixtures of Discrete Distributions
    Li, Jian
    Rabani, Yuval
    Schulman, Leonard J.
    Swamy, Chaitanya
    STOC'15: PROCEEDINGS OF THE 2015 ACM SYMPOSIUM ON THEORY OF COMPUTING, 2015, : 743 - 752
  • [46] STATISTICAL MODEL OF SOME TIME SERIES
    PALMER, DS
    NATURE, 1958, 181 (4624) : 1677 - 1677
  • [47] Statistical Characteristics of Nonstationary Time Series
    Fei, Wanchun
    Bai, Lun
    INTERNATIONAL JOURNAL OF NONLINEAR SCIENCES AND NUMERICAL SIMULATION, 2010, 11 : 295 - 299
  • [48] STATISTICAL FORECASTING OF TELEPHONE TIME SERIES
    TOMASEK, O
    TELECOMMUNICATION JOURNAL, 1972, 39 (12): : 725 - &
  • [49] Statistical properties of financial time series
    Krämer, W
    JAHRBUCHER FUR NATIONALOKONOMIE UND STATISTIK, 2002, 222 (02): : 210 - 229
  • [50] STATISTICAL INFERENCE FOR FUNCTIONAL TIME SERIES
    Li, Jie
    Yang, Lijian
    STATISTICA SINICA, 2023, 33 (01) : 519 - 549