An efficient similarity searching algorithm based on clustering for time series

被引:0
|
作者
Feng, Yucai [1 ]
Jiang, Tao [1 ]
Zhou, Yingbiao [1 ]
Li, Junkui [1 ]
机构
[1] Huazhong Univ Sci & Technol, Coll Comp Sci & Technol, Wuhan 430074, Peoples R China
关键词
time series; clustering; similarity search; indexing;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Indexing large time series databases is crucial for efficient searching of time series queries. In the paper, we propose a novel indexing scheme RQI (Range Query based on Index) which includes three filtering methods: first-k filtering, indexing lower bounding and upper bounding as well as triangle inequality pruning. The basic idea is calculating wavelet coefficient whose first k coefficients are used to form a MBR. (minimal bounding rectangle) based on haar wavelet transform for each time series and then using point filtering method; At the same time, lower bounding and upper bounding feature of each time series is calculated, in advance, and stored into index structure. At last, triangle inequality pruning method is used by calculating the distance between time series beforehand. Then we introduce a novel lower bounding distance function SLBS (Symmetrical Lower Bounding based on Segment) and a novel clustering algorithm CSA (Clustering based on Segment Approximation) in order to further improve the search efficiency of point filtering method by keeping a good clustering trait of index structure. Extensive experiments over both synthetic and real datasets show that, our technologies provide perfect pruning power and could obtain an order of magnitude performance improvement for time series queries over traditional naive evaluation techniques.
引用
收藏
页码:360 / 373
页数:14
相关论文
共 50 条
  • [21] Similarity Preserving Representation Learning for Time Series Clustering
    Lei, Qi
    Yi, Jinfeng
    Vaculin, Roman
    Wu, Lingfei
    Dhillon, Inderjit S.
    [J]. PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 2845 - 2851
  • [22] Similarity Measure Selection for Clustering Time Series Databases
    Mori, Usue
    Mendiburu, Alexander
    Lozano, Jose A.
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2016, 28 (01) : 181 - 195
  • [23] Efficient time series matching based on HMTS algorithm
    Zhang, M
    Tan, Y
    [J]. ADVANCES IN NATURAL COMPUTATION, PT 2, PROCEEDINGS, 2005, 3611 : 476 - 482
  • [24] An Expanding Clustering Algorithm Based on Density Searching
    Tan, Liguo
    Liu, Yang
    Chen, Xinglin
    [J]. INFORMATION AND MANAGEMENT ENGINEERING, PT VI, 2011, 236 : 110 - 116
  • [25] An Algorithm Based on Time Series Similarity Measurement for Missing Data Filling
    Li Hui-min
    Wang Pu
    Fang Li-ying
    Liu Jing-wei
    [J]. PROCEEDINGS OF THE 2012 24TH CHINESE CONTROL AND DECISION CONFERENCE (CCDC), 2012, : 3933 - 3935
  • [26] Wind Power Forecasting Algorithm Based on Similarity of Multivariate Time Series
    Jin, Hui-Ying
    Yang, Yong-Qiang
    Wang, Zhan-Feng
    Ma, Wei-Jun
    Su, Yong
    Pan, Yun-Peng
    [J]. INTERNATIONAL CONFERENCE ON ENERGY DEVELOPMENT AND ENVIRONMENTAL PROTECTION (EDEP 2017), 2017, 168 : 77 - 84
  • [27] An effective similarity measure algorithm for time series based on key points
    Liu, Quan
    Li, Shihua
    Fang, Yilin
    Long, Tao
    Cao, Jiangyong
    Liu, Huan
    [J]. 2016 8TH INTERNATIONAL CONFERENCE ON INTELLIGENT HUMAN-MACHINE SYSTEMS AND CYBERNETICS (IHMSC), VOL. 2, 2016, : 17 - 20
  • [28] Permutation Based Algorithm Improved by Classes for Similarity Searching
    Figueroa, Karina
    Camarena-Ibarrola, Antonio
    Valero, Luis
    [J]. COMPUTACION Y SISTEMAS, 2022, 26 (01): : 71 - 79
  • [29] An Improved Algorithm of Similarity Based on Clustering in XML
    Wang, Puqing
    [J]. PROCEEDINGS OF THE 2016 2ND WORKSHOP ON ADVANCED RESEARCH AND TECHNOLOGY IN INDUSTRY APPLICATIONS, 2016, 81 : 837 - 841
  • [30] A Clustering Algorithm Based on Variance-Similarity
    Li, Zhendong
    Li, Fei
    [J]. MEASUREMENT TECHNOLOGY AND ENGINEERING RESEARCHES IN INDUSTRY, PTS 1-3, 2013, 333-335 : 1306 - +