Statistical Learning of Discrete States in Time Series

被引：26

作者：

Li, Hao ^{[1
]}

Yang, Haw ^{[1
]}

机构：

[1] Princeton Univ, Dept Chem, Princeton, NJ 08544 USA

来源：

JOURNAL OF PHYSICAL CHEMISTRY B | 2019年 / 123卷 / 03期

关键词：

CONTINUOUS STOCHASTIC-PROCESSES; SINGLE-MOLECULE; TRAJECTORY ENTROPY; FLUORESCENCE; INFORMATION; DYNAMICS; SPECTROSCOPY; DIFFUSION; PHOTON; RATES;

D O I：

10.1021/acs.jpcb.8b10561

中图分类号：

O64 [物理化学（理论化学）、化学物理学];

学科分类号：

070304 ; 081704 ;

摘要：

Time series obtained from time-dependent experiments contain rich information on kinetics and dynamics of the system under investigation. This work describes an unsupervised learning framework, along with the derivation of the necessary analytical expressions, for the analysis of Gaussian-distributed time series that exhibit discrete states. After the time series has been partitioned into segments in a model-free manner using the previously developed change-point (CP) method, this protocol starts with an agglomerative hierarchical clustering algorithm to classify the detected segments into possible states. The initial state clustering is further refined using an expectation-maximization (EM) procedure, and the number of states is determined by a Bayesian information criterion (BIC). Also introduced here is an achievement scalarization function, usually seen in artificial intelligence literature, for quantitatively assessing the performance of state determination. The statistical learning framework, which is comprised of three stages, detection of signal change, clustering, and number-of-state determination, was thoroughly characterized using simulated trajectories with random intensity segments that have no underlying kinetics, and its performance was critically evaluated. The application to experimental data is also demonstrated. The results suggested that this general framework, the implementation of which is based on firm theoretical foundations and does not require the imposition of any kinetics model, is powerful in determining the number of states, the parameters contained in each state, as well as the associated statistical significance.

引用

页码：689 / 701

页数：13

共 50 条

[41] A NOTE ON PREDICTION FOR DISCRETE TIME SERIES
Morvai, Gusztav
Weiss, Benjamin
KYBERNETIKA, 2012, 48 (04) : 809 - 823
[42] Time series with discrete semistable marginals
Nadjib Bouzar
K. Jayakumar
Statistical Papers, 2008, 49 : 619 - 635
[43] Comprehensive Prediction of Stock Prices Using Time Series, Statistical, Machine Learning, and Deep Learning Models
Sen, Jaydip
Kumar, Abhishek
Thomas, Aji
Todi, Nishant Kumar
Olemmyan, Olive
Tripathi, Swapnil
Arora, Vaibhav
TechRxiv, 2023,
[44] Learning Arbitrary Statistical Mixtures of Discrete Distributions
Li, Jian
Rabani, Yuval
Schulman, Leonard J.
Swamy, Chaitanya
STOC'15: PROCEEDINGS OF THE 2015 ACM SYMPOSIUM ON THEORY OF COMPUTING, 2015, : 743 - 752
[45] Statistical Analysis of Discrete-valued Time Series by Parsimonious High-order Markov Chains
Kharin, Yuriy
AUSTRIAN JOURNAL OF STATISTICS, 2020, 49 (04) : 76 - 88
[46] STATISTICAL MODEL OF SOME TIME SERIES
PALMER, DS
NATURE, 1958, 181 (4624) : 1677 - 1677
[47] Statistical Characteristics of Nonstationary Time Series
Fei, Wanchun
Bai, Lun
INTERNATIONAL JOURNAL OF NONLINEAR SCIENCES AND NUMERICAL SIMULATION, 2010, 11 : 295 - 299
[48] STATISTICAL FORECASTING OF TELEPHONE TIME SERIES
TOMASEK, O
TELECOMMUNICATION JOURNAL, 1972, 39 (12): : 725 - &
[49] Statistical properties of financial time series
Krämer, W
JAHRBUCHER FUR NATIONALOKONOMIE UND STATISTIK, 2002, 222 (02): : 210 - 229
[50] STATISTICAL INFERENCE FOR FUNCTIONAL TIME SERIES
Li, Jie
Yang, Lijian
STATISTICA SINICA, 2023, 33 (01) : 519 - 549

← 1 2 3 4 5 →