An evolutionary approach for efficient prototyping of large time series datasets

被引:8
|
作者
Leon-Alcaide, Pablo [1 ]
Rodriguez-Benitez, Luis [1 ]
Castillo-Herrera, Ester [1 ]
Moreno-Garcia, Juan [2 ]
Jimenez-Linares, Luis [1 ]
机构
[1] Univ Castilla La Mancha, Escuela Super Informat, Dept Informat & Syst Technol, Paseo Univ S-N, Ciudad Real, Spain
[2] Univ Castilla La Mancha, Escuela Ingn Ind, Dept Informat & Syst Technol, Ave Carlos 3 S-N, Toledo, Spain
关键词
Time series summarization; Genetic algorithms; Elastic distances; Data mining; AVERAGING METHOD; ALGORITHM; ALIGNMENT;
D O I
10.1016/j.ins.2019.09.044
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We here describe an algorithm based on an evolutionary strategy to find the prototype series of a set of time series, and we use Dynamic Time Warping (DTW) as a distance measure between series, and do not restrict the search space to the series in the set. The problem of calculating the centroid of a set of time series can be addressed as a minimization problem, using genetic algorithms. Our proposal may be considered among the set of non-classical approaches to genetic algorithms, where an individual gene is a candidate time series for being the centroid or representative of the whole set of series. The representation and operators of genetic algorithms are redesigned, in order to generate efficient summaries, the fitness function of each candidate series to be a prototype is approximated, comparing them only with a subset of randomly selected time series from the original dataset. Three areas are looked at in order to assess the goodness of our proposal: the performance of the prototype generated in terms of a fitness function, the consistency of the prototype generation for use in classical grouping algorithms, and its use in classification algorithms based on the nearest prototypes. (C) 2019 Elsevier Inc. All rights reserved.
引用
收藏
页码:74 / 93
页数:20
相关论文
共 50 条
  • [1] Clustering of large time series datasets
    Aghabozorgi, Saeed
    Teh, Ying Wah
    [J]. INTELLIGENT DATA ANALYSIS, 2014, 18 (05) : 793 - 817
  • [2] TS-Evolutionary_Prototyping: A Python']Python module for finding the prototype in large sets of time series
    Rodriguez-Benitez, Luis
    Leon-Alcaide, Pablo
    del Castillo, Ester
    Cabanero-Gomez, Luis
    Liu, Jun
    Jimenez-Linares, Luis
    [J]. SOFTWARE IMPACTS, 2023, 15
  • [3] FTSPlot: Fast Time Series Visualization for Large Datasets
    Riss, Michael
    [J]. PLOS ONE, 2014, 9 (04):
  • [4] An Evolutionary Approach for Modeling Time Series
    Bautu, Elena
    Bautu, Andrei
    Luchian, Henri
    [J]. PROCEEDINGS OF THE 10TH INTERNATIONAL SYMPOSIUM ON SYMBOLIC AND NUMERIC ALGORITHMS FOR SCIENTIFIC COMPUTING, 2009, : 507 - +
  • [5] Efficient Optimization of Echo State Networks for Time Series Datasets
    Maat, Jacob Reinier
    Gianniotis, Nikos
    Protopapas, Pavlos
    [J]. 2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018, : 102 - 108
  • [6] DDR: an index method for large time-series datasets
    An, JY
    Chen, YPP
    Chen, HX
    [J]. INFORMATION SYSTEMS, 2005, 30 (05) : 333 - 348
  • [7] A new evolutionary approach for time series forecasting
    Ferreira, Tiago A. E.
    Vasconcelos, Gennano C.
    Adeodato, Paulo J. L.
    [J]. 2007 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DATA MINING, VOLS 1 AND 2, 2007, : 616 - 623
  • [8] Subsampling the Concurrent AdaBoost Algorithm: An Efficient Approach for Large Datasets
    Allende-Cid, Hector
    Acuna, Diego
    Allende, Hector
    [J]. PROGRESS IN PATTERN RECOGNITION, IMAGE ANALYSIS, COMPUTER VISION, AND APPLICATIONS, CIARP 2016, 2017, 10125 : 318 - 325
  • [9] A sampling-based approach for efficient clustering in large datasets
    Exarchakis, Georgios
    Oubari, Omar
    Lenz, Gregor
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 12393 - 12402
  • [10] Automatic Versioning of Time Series Datasets: a FAIR Algorithmic Approach
    Gonzalez-Cebrian, Alba
    McGuinness, Luke A.
    Bradford, Michael
    Chis, Adriana E.
    Gonzalez-Velez, Horacio
    [J]. 2022 IEEE 18TH INTERNATIONAL CONFERENCE ON E-SCIENCE (ESCIENCE 2022), 2022, : 204 - 213