An efficient method for time series similarity search using binary code representation and hamming distance

被引:7
|
作者
Zhang, Haowen [1 ]
Dong, Yabo [1 ]
Li, Jing [1 ]
Xu, Duanqing [1 ]
机构
[1] Zhejiang Univ, Coll Comp Sci & Technol, Hangzhou, Zhejiang, Peoples R China
关键词
Time series; similarity measure; binary code representation; Hamming Distance; APPROXIMATION;
D O I
10.3233/IDA-194876
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Time series similarity search is an essential operation in time series data mining and has received much higher interest along with the growing popularity of time series data. Although many algorithms to solve this problem have been investigated, there is a challenging demand for supporting similarity search in a fast and accurate way. In this paper, we present a novel approach, TS2BC, to perform time series similarity search efficiently and effectively. TS2BC uses binary code to represent time series and measures the similarity under the Hamming Distance. Our method is able to represent original data compactly and can handle shifted time series and work with time series of different lengths. Moreover, it can be performed with reasonably low complexity due to the efficiency of calculating the Hamming Distance. We extensively compare TS2BC with state-of-the-art algorithms in classification framework using 61 online datasets. Experimental results show that TS2BC achieves better or comparative performance than other the state-of-the-art in accuracy and is much faster than most existing algorithms. Furthermore, we propose an approximate version of TS2BC to speed up the query procedure and test its efficiency by experiment.
引用
收藏
页码:439 / 461
页数:23
相关论文
共 50 条
  • [21] Improving Hamming-Distance Computation for Adaptive Similarity Search Approach
    Singh, Vikram
    Kumar, Chandradeep
    [J]. INTERNATIONAL JOURNAL OF INTELLIGENT INFORMATION TECHNOLOGIES, 2022, 18 (02)
  • [22] Similarity search in time series databases using moments
    Toshniwal, D
    Joshi, RC
    [J]. PROCEEDINGS OF THE 2004 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA'04), 2004, : 164 - 171
  • [23] Iris Code Matching using Adaptive Hamming Distance
    Dehkordi, Arezou Banitalebi
    Abu-Bakar, Syed A. R.
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON SIGNAL AND IMAGE PROCESSING APPLICATIONS (ICSIPA), 2015, : 404 - 408
  • [24] An Enhanced Binary Symbolic Representation for Time Series Data Mining Based Similarity
    Sun, Meiyu
    Fang, Jianan
    [J]. 2008 7TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION, VOLS 1-23, 2008, : 7130 - 7134
  • [25] A novel bit level time series representation with implication of similarity search and clustering
    Ratanamahatana, C
    Keogh, E
    Bagnal, AJ
    Lonardi, S
    [J]. ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2005, 3518 : 771 - 777
  • [26] Similarity search based on shape representation in time-series data sets
    Jiang, Rong
    Li, Deyi
    [J]. Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2000, 37 (05): : 601 - 608
  • [27] Physical database design for efficient time-series similarity search
    Kim, Sang-Wook
    Kim, Jinho
    Park, Sanghyun
    [J]. IEICE TRANSACTIONS ON COMMUNICATIONS, 2008, E91B (04) : 1251 - 1254
  • [28] An efficient image authentication method based on Hamming code
    Chan, Chi-Shiang
    Chang, Chin-Chen
    [J]. PATTERN RECOGNITION, 2007, 40 (02) : 681 - 690
  • [29] Similarity Search in Time Series Data Using Time Weighted Slopes
    Toshniwal, Durga
    Joshi, R. C.
    [J]. INFORMATICA-JOURNAL OF COMPUTING AND INFORMATICS, 2005, 29 (01): : 79 - 88
  • [30] Speeding Up Similarity Search on a Large Time Series Dataset under Time Warping Distance
    Ruengronghirunya, Pongsakorn
    Niennattrakul, Vit
    Ratanamahatana, Chotirat Ann
    [J]. ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2009, 5476 : 981 - 988