Greedy Gaussian segmentation of multivariate time series

被引:1
|
作者
David Hallac
Peter Nystrup
Stephen Boyd
机构
[1] Stanford University,
[2] Technical University of Denmark,undefined
关键词
Time series analysis; Change-point detection; Financial regimes; Text segmentation; Covariance regularization; Greedy algorithms; 37M10: Time series analysis;
D O I
暂无
中图分类号
学科分类号
摘要
We consider the problem of breaking a multivariate (vector) time series into segments over which the data is well explained as independent samples from a Gaussian distribution. We formulate this as a covariance-regularized maximum likelihood problem, which can be reduced to a combinatorial optimization problem of searching over the possible breakpoints, or segment boundaries. This problem can be solved using dynamic programming, with complexity that grows with the square of the time series length. We propose a heuristic method that approximately solves the problem in linear time with respect to this length, and always yields a locally optimal choice, in the sense that no change of any one breakpoint improves the objective. Our method, which we call greedy Gaussian segmentation (GGS), easily scales to problems with vectors of dimension over 1000 and time series of arbitrary length. We discuss methods that can be used to validate such a model using data, and also to automatically choose appropriate values of the two hyperparameters in the method. Finally, we illustrate our GGS approach on financial time series and Wikipedia text data.
引用
收藏
页码:727 / 751
页数:24
相关论文
共 50 条
  • [1] Greedy Gaussian segmentation of multivariate time series
    Hallac, David
    Nystrup, Peter
    Boyd, Stephen
    [J]. ADVANCES IN DATA ANALYSIS AND CLASSIFICATION, 2019, 13 (03) : 727 - 751
  • [2] Greedy copula segmentation of multivariate non-stationary time series for climate change adaptation
    Heo, Taemin
    Manuel, Lance
    [J]. PROGRESS IN DISASTER SCIENCE, 2022, 14
  • [3] SIMULATION OF MULTIVARIATE GAUSSIAN TIME-SERIES
    KROGSTAD, HE
    [J]. COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 1989, 18 (03) : 929 - 941
  • [4] Utilizing Multivariate Time Series for Semantic Segmentation
    van Leeuwen, Frederique
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2019, : 6125 - 6127
  • [5] Multivariate Segmentation of Time Series with Differential Evolution
    Graves, Daniel
    Pedrycz, Witold
    [J]. PROCEEDINGS OF THE JOINT 2009 INTERNATIONAL FUZZY SYSTEMS ASSOCIATION WORLD CONGRESS AND 2009 EUROPEAN SOCIETY OF FUZZY LOGIC AND TECHNOLOGY CONFERENCE, 2009, : 1108 - 1113
  • [6] A Segmentation Technology for Multivariate Contextual Time Series
    Zhang, Hui-Juan
    Huang, Jia-Cheng
    [J]. 2017 IEEE 4TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING & MACHINE INTELLIGENCE (ISCMI), 2017, : 71 - 74
  • [7] Inference of partial correlations of a multivariate Gaussian time series
    Dilernia, A. S.
    Fiecas, M.
    Zhang, L.
    [J]. BIOMETRIKA, 2024,
  • [8] Similarity Measure of Multivariate Time Series Based on Segmentation
    Li, Zhengxin
    Liu, Jia
    Zhang, Xiaofeng
    [J]. ICMLC 2020: 2020 12TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND COMPUTING, 2018, : 47 - 51
  • [9] Segmentation of biological multivariate time-series data
    Nooshin Omranian
    Bernd Mueller-Roeber
    Zoran Nikoloski
    [J]. Scientific Reports, 5
  • [10] Memetic algorithm for multivariate time-series segmentation
    Lim, Hyunki
    Choi, Heeseung
    Choi, Yeji
    Kim, Ig-Jae
    [J]. PATTERN RECOGNITION LETTERS, 2020, 138 : 60 - 67