tsflex: Flexible time series processing & feature extraction

被引:10
|
作者
Van der Donckt, Jonas [1 ]
Van der Donckt, Jeroen [1 ]
Deprost, Emiel [1 ]
Van Hoecke, Sofie [1 ]
机构
[1] Univ Ghent, IMEC, IDLab, Technol Pk Zwijnaarde 126, B-9052 Zwijnaarde, Belgium
基金
比利时弗兰德研究基金会;
关键词
Time series; Processing; Feature extraction; Machine learning; !text type='Python']Python[!/text;
D O I
10.1016/j.softx.2021.100971
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Time series processing and feature extraction are crucial and time-intensive steps in conventional machine learning pipelines. Existing packages are limited in their applicability, as they cannot cope with irregularly-sampled or asynchronous data and make strong assumptions about the data format. Moreover, these packages do not focus on execution speed and memory efficiency, resulting in considerable overhead. We present tsflex, a Python toolkit for time series processing and feature extraction, that focuses on performance and flexibility, enabling broad applicability. This toolkit leverages window-stride arguments of the same data type as the sequence-index, and maintains the sequence-index through all operations. tsflex is flexible as it supports (1) multivariate time series, (2) multiple window-stride configurations, and (3) integrates with processing and feature functions from other packages, while (4) making no assumptions about the data sampling regularity, series alignment, and data type. Other functionalities include multiprocessing, detailed execution logging, chunking sequences, and serialization. Benchmarks show that tsflex is faster and more memory-efficient compared to similar packages, while being more permissive and flexible in its utilization. (C) 2022 The Author(s). Published by Elsevier B.V.
引用
收藏
页数:6
相关论文
共 50 条
  • [31] Automatic feature extraction and selection for classification of cyclical time series data
    Schneider, Tizian
    Helwig, Nikolai
    Schuetze, Andreas
    TM-TECHNISCHES MESSEN, 2017, 84 (03) : 198 - 206
  • [32] Hierarchical Time Series Feature Extraction for Power Consumption Anomaly Detection
    Ouyang, Zhiyou
    Sun, Xiaokui
    Yue, Dong
    ADVANCED COMPUTATIONAL METHODS IN ENERGY, POWER, ELECTRIC VEHICLES, AND THEIR INTEGRATION, LSMS 2017, PT 3, 2017, 763 : 267 - 275
  • [33] Feature Extraction of Time series data for Wind Speed Power generation
    Khanna, Manju
    Srinath, N. K.
    Mendiratta, J. K.
    2016 IEEE 6TH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING (IACC), 2016, : 169 - 173
  • [34] Novel hidden feature extraction method for chaotic time series prediction
    Lei, Miao
    Peng, Yu
    Peng, Xiyuan
    Yi Qi Yi Biao Xue Bao/Chinese Journal of Scientific Instrument, 2014, 35 (01): : 1 - 7
  • [35] Feature extraction from time-series data for process monitoring
    Fujiwara, T
    Nishitani, H
    KAGAKU KOGAKU RONBUNSHU, 1996, 22 (05) : 1103 - 1110
  • [36] FRUITS: feature extraction using iterated sums for time series classification
    Diehl, Joscha
    Krieg, Richard
    DATA MINING AND KNOWLEDGE DISCOVERY, 2024, 38 (06) : 4122 - 4156
  • [37] Under Sampling Adaboosting Shapelet Transformation for Time Series Feature Extraction
    Joo, Yohan
    Jeong, Jongpil
    COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2019, PT VI: 19TH INTERNATIONAL CONFERENCE, SAINT PETERSBURG, RUSSIA, JULY 14, 2019, PROCEEDINGS, PART VI, 2019, 11624 : 69 - 80
  • [38] Unsupervised feature extraction from multivariate time series for outlier detection
    Matsue, Kiyotaka
    Sugiyama, Mahito
    INTELLIGENT DATA ANALYSIS, 2022, 26 (06) : 1451 - 1467
  • [39] Feature extraction for time series classification using discriminating wavelet coefficients
    Zhang, Hui
    Ho, Tu Bao
    Lin, Mao-Song
    Liang, Xuefeng
    ADVANCES IN NEURAL NETWORKS - ISNN 2006, PT 1, 2006, 3971 : 1394 - 1399
  • [40] An improved feature extraction technique for high volume time series data
    Anstey, Jonathan S.
    Peters, Dennis K.
    Dawson, Chris
    PROCEEDINGS OF THE FOURTH IASTED INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, PATTERN RECOGNITION, AND APPLICATIONS, 2007, : 74 - +