A Novel Parallel Scheme for Fast Similarity Search in Large Time Series

被引:9
|
作者
Yin Hong [1 ,3 ]
Yang Shuqiang [1 ]
Ma Shaodong [2 ]
Liu Fei [1 ]
Chen Zhikun [1 ]
机构
[1] Natl Univ Def Technol, Coll Comp, Changsha 410073, Hunan, Peoples R China
[2] Univ Hull, Sch Engn, Kingston Upon Hull HU6 7RX, N Humberside, England
[3] Xiangyang Sch NCOs, Xiangyang 441118, Peoples R China
基金
中国国家自然科学基金;
关键词
similarity; DTW; warping path; time series; MapReduce; parallelization; cluster;
D O I
10.1109/CC.2015.7084408
中图分类号
TN [电子技术、通信技术];
学科分类号
0809 ;
摘要
The similarity search is one of the fundamental components in time series data mining, e.g. clustering, classification, association rules mining. Many methods have been proposed to measure the similarity between time series, including Euclidean distance, Manhattan distance, and dynamic time warping (DTW). In contrast, DTW has been suggested to allow more robust similarity measure and be able to find the optimal alignment in time series. However, due to its quadratic time and space complexity, DTW is not suitable for large time series datasets. Many improving algorithms have been proposed for DTW search in large databases, such as approximate search or exact indexed search. Unlike the previous modified algorithm, this paper presents a novel parallel scheme for fast similarity search based on DTW, which is called MRDTW (MapRedcue-based DTW). The experimental results show that our approach not only retained the original accuracy as DTW, but also greatly improved the efficiency of similarity measure in large time series.
引用
收藏
页码:129 / 140
页数:12
相关论文
共 50 条
  • [21] Adaptive similarity search for the retrieval of rare events from large time series databases
    Schlegl, Thomas
    Schlegl, Stefan
    Tomaselli, Domenico
    West, Nikolai
    Deuse, Jochen
    ADVANCED ENGINEERING INFORMATICS, 2022, 52
  • [22] Adaptive similarity search for the retrieval of rare events from large time series databases
    Schlegl, Thomas
    Schlegl, Stefan
    Tomaselli, Domenico
    West, Nikolai
    Deuse, Jochen
    Advanced Engineering Informatics, 2022, 52
  • [23] Weighted Hashing for Fast Large Scale Similarity Search
    Wang, Qifan
    Zhang, Dan
    Si, Luo
    PROCEEDINGS OF THE 22ND ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM'13), 2013, : 1185 - 1188
  • [24] A novel indexing scheme for similarity search in metric spaces
    Tosun, Umut
    PATTERN RECOGNITION LETTERS, 2015, 54 : 69 - 74
  • [25] An Efficient Similarity Search For Financial Multivariate Time Series
    Zhou, Dazhuo
    Li, Minqiang
    Yan, Hongcan
    2008 4TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, NETWORKING AND MOBILE COMPUTING, VOLS 1-31, 2008, : 11161 - 11164
  • [26] Similarity search on time series based on threshold queries
    Assfalg, J
    Kriegel, HP
    Kröger, P
    Kunath, P
    Pryakhin, A
    Renz, M
    ADVANCES IN DATABASE TECHNOLOGY - EDBT 2006, 2006, 3896 : 276 - 294
  • [27] GPU Acceleration of Similarity Search for Uncertain Time Series
    Hwang, Jun
    Kozawa, Yusuke
    Amagasa, Toshiyuki
    Kitagawa, Hiroyuki
    2014 17TH INTERNATIONAL CONFERENCE ON NETWORK-BASED INFORMATION SYSTEMS (NBIS 2014), 2014, : 626 - 631
  • [28] Set-based Similarity Search for Time Series
    Peng, Jinglin
    Wang, Hongzhi
    Li, Jianzhong
    Gao, Hong
    SIGMOD'16: PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2016, : 2039 - 2052
  • [29] Time Series Similarity Search Methods for Sensor Data
    Automatic Control and Computer Sciences, 2022, 56 : 120 - 129
  • [30] Time Series Similarity Search Methods for Sensor Data
    Jawale, Anupama
    Magar, Ganesh
    AUTOMATIC CONTROL AND COMPUTER SCIENCES, 2022, 56 (02) : 120 - 129