Feature Representation and Similarity Measure Based on Covariance Sequence for Multivariate Time Series

被引:8
|
作者
Li, Hailin [1 ,2 ]
Lin, Chunpei [1 ]
Wan, Xiaoji [1 ]
Li, Zhengxin [3 ]
机构
[1] Huaqiao Univ, Coll Business Adm, Quanzhou 362021, Fujian, Peoples R China
[2] Huaqiao Univ, Res Ctr Appl Stat & Big Data, Xiamen 361021, Fujian, Peoples R China
[3] Air Force Engn Univ, Inst Equipment Management & Safety Engn, Xian 710051, Shaanxi, Peoples R China
来源
IEEE ACCESS | 2019年 / 7卷
基金
中国国家自然科学基金;
关键词
Multivariate time series; covariance matrix; principal component analysis; data mining; CLASSIFICATION;
D O I
10.1109/ACCESS.2019.2915602
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The high dimension of multivariate time series (MTS) is one of the major factors that impact on the efficiency and effectiveness of data mining. It has two kinds of dimensions, time-based dimensionality, and variable-based dimensionality. They often cause most of the algorithms and techniques applied to the field of MTS data mining to be a failure. In view of the importance of the correlation between any two variables in an MTS, the covariances between any two variables are applied to analyze the extraction of the features for every MTS. In this way, a covariance sequence can be constructed to represent the characteristic of the MTS. Furthermore, an excellent method of dimensionality reduction, principal component analysis (PCA), is used to extract the features of the covariance sequences that derived from an MTS dataset. Thus Euclidean distance is suitable to measure the similarity between the features fast. The experimental results demonstrate that the proposed method not only can handle multivariate time series with different lengths but also is more efficient and effective than the existing methods for the MTS data mining.
引用
下载
收藏
页码:67018 / 67026
页数:9
相关论文
共 50 条
  • [31] Trend and Value based Time Series Representation for Similarity Search
    Kane, Aminata
    2017 IEEE THIRD INTERNATIONAL CONFERENCE ON MULTIMEDIA BIG DATA (BIGMM 2017), 2017, : 252 - 259
  • [32] Time series classification with feature covariance matrices
    Hamza Ergezer
    Kemal Leblebicioğlu
    Knowledge and Information Systems, 2018, 55 : 695 - 718
  • [33] Time series classification with feature covariance matrices
    Ergezer, Hamza
    Leblebicioglu, Kemal
    KNOWLEDGE AND INFORMATION SYSTEMS, 2018, 55 (03) : 695 - 718
  • [34] Fuzzy clustering based on feature weights for multivariate time series
    Li, Hailin
    Wei, Miao
    KNOWLEDGE-BASED SYSTEMS, 2020, 197
  • [35] Energy Time Series Forecasting Based on Pattern Sequence Similarity
    Martinez-Alvarez, Francisco
    Troncoso, Alicia
    Riquelme, Jose C.
    Aguilar-Ruiz, Jesus S.
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2011, 23 (08) : 1230 - 1243
  • [36] Wind Power Forecasting Algorithm Based on Similarity of Multivariate Time Series
    Jin, Hui-Ying
    Yang, Yong-Qiang
    Wang, Zhan-Feng
    Ma, Wei-Jun
    Su, Yong
    Pan, Yun-Peng
    INTERNATIONAL CONFERENCE ON ENERGY DEVELOPMENT AND ENVIRONMENTAL PROTECTION (EDEP 2017), 2017, 168 : 77 - 84
  • [37] Surveillance of the covariance matrix of multivariate nonlinear time series
    Sliwa, P
    Schmid, W
    STATISTICS, 2005, 39 (03) : 221 - 246
  • [38] Inverse covariance operators of multivariate nonstationary time series
    Krampe, Jonas
    Rao, Suhasini subba
    BERNOULLI, 2024, 30 (02) : 1177 - 1196
  • [39] Toeplitz Inverse Covariance-Based Clustering of Multivariate Time Series Data
    Hallac, David
    Vare, Sagar
    Boyd, Stephen
    Leskovec, Jure
    PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 5254 - 5258
  • [40] Robust estimation for the covariance matrix of multivariate time series based on normal mixtures
    Kim, Byungsoo
    Lee, Sangyeol
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2013, 57 (01) : 125 - 140