An efficient similarity searching algorithm based on clustering for time series

被引:0
|
作者
Feng, Yucai [1 ]
Jiang, Tao [1 ]
Zhou, Yingbiao [1 ]
Li, Junkui [1 ]
机构
[1] Huazhong Univ Sci & Technol, Coll Comp Sci & Technol, Wuhan 430074, Peoples R China
关键词
time series; clustering; similarity search; indexing;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Indexing large time series databases is crucial for efficient searching of time series queries. In the paper, we propose a novel indexing scheme RQI (Range Query based on Index) which includes three filtering methods: first-k filtering, indexing lower bounding and upper bounding as well as triangle inequality pruning. The basic idea is calculating wavelet coefficient whose first k coefficients are used to form a MBR. (minimal bounding rectangle) based on haar wavelet transform for each time series and then using point filtering method; At the same time, lower bounding and upper bounding feature of each time series is calculated, in advance, and stored into index structure. At last, triangle inequality pruning method is used by calculating the distance between time series beforehand. Then we introduce a novel lower bounding distance function SLBS (Symmetrical Lower Bounding based on Segment) and a novel clustering algorithm CSA (Clustering based on Segment Approximation) in order to further improve the search efficiency of point filtering method by keeping a good clustering trait of index structure. Extensive experiments over both synthetic and real datasets show that, our technologies provide perfect pruning power and could obtain an order of magnitude performance improvement for time series queries over traditional naive evaluation techniques.
引用
下载
收藏
页码:360 / 373
页数:14
相关论文
共 50 条
  • [41] Spider algorithm for clustering multivariate time series
    Department of Computational Intelligence and Systems Science, Tokyo Institute of Technology, 4259 Nagatsuta-cho, Midori-ku, Yokohama 226-8503, Japan
    WSEAS Trans. Inf. Sci. Appl., 2006, 3 (485-492):
  • [42] A new clustering algorithm for time series analysis
    Zeng, Jianping
    Guo, Donghui
    INTELLIGENT CONTROL AND AUTOMATION, 2006, 344 : 759 - 764
  • [43] SHAPE-BASED TIME SERIES SIMILARITY MEASURE AND PATTERN DISCOVERY ALGORITHM
    Zeng Fanzi Qiu Zhengding Li Dongsheng Yue Jianhai(Institute of Information and Science
    Journal of Electronics(China), 2005, (02) : 142 - 148
  • [44] Similarity search algorithm for multivariate time series based on empirical mode decomposition
    Wang, Yan
    Han, Meng
    Ma, Qianqian
    Journal of Computational Information Systems, 2014, 10 (08): : 3247 - 3254
  • [45] SHAPE-BASED TIME SERIES SIMILARITY MEASURE AND PATTERN DISCOVERY ALGORITHM
    Zeng Fanzi Qiu Zhengding Li Dongsheng Yue JianhaiInstitute of Information and Science Beijing Jiaotong University Beijing ChinaDongjian Hydropower Plant Hunan China
    Journal of Electronics, 2005, (02) : 142 - 148
  • [46] Clustering algorithm based on broad first searching neighbors
    Qian, Jiangbo
    Dong, Yisheng
    Dongnan Daxue Xuebao (Ziran Kexue Ban)/Journal of Southeast University (Natural Science Edition), 2004, 34 (01): : 109 - 112
  • [47] A Shape Based Similarity Measure for Time Series Classification with Weighted Dynamic Time Warping Algorithm
    Ye, Yanqing
    Niu, Caiyun
    Jiang, Jiang
    Ge, Bingfeng
    Yang, Kewei
    2017 4TH INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND CONTROL ENGINEERING (ICISCE), 2017, : 104 - 109
  • [48] An Efficient Similarity Search For Financial Multivariate Time Series
    Zhou, Dazhuo
    Li, Minqiang
    Yan, Hongcan
    2008 4TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, NETWORKING AND MOBILE COMPUTING, VOLS 1-31, 2008, : 11161 - 11164
  • [49] An Effective and Efficient Similarity-Matrix-Based Algorithm for Clustering Big Mobile Social Data
    Bordogna, Gloria
    Frigerio, Luca
    Cuzzocrea, Alfredo
    Psaila, Giuseppe
    2016 15TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA 2016), 2016, : 514 - 521
  • [50] Underlying techniques of efficient similarity search on time series
    Feng, Yu-Cai
    Jiang, Tao
    Li, Guo-Hui
    Zhu, Hong
    Jisuanji Xuebao/Chinese Journal of Computers, 2009, 32 (11): : 2107 - 2122