A Novel Parallel Scheme for Fast Similarity Search in Large Time Series

被引:9
|
作者
Yin Hong [1 ,3 ]
Yang Shuqiang [1 ]
Ma Shaodong [2 ]
Liu Fei [1 ]
Chen Zhikun [1 ]
机构
[1] Natl Univ Def Technol, Coll Comp, Changsha 410073, Hunan, Peoples R China
[2] Univ Hull, Sch Engn, Kingston Upon Hull HU6 7RX, N Humberside, England
[3] Xiangyang Sch NCOs, Xiangyang 441118, Peoples R China
基金
中国国家自然科学基金;
关键词
similarity; DTW; warping path; time series; MapReduce; parallelization; cluster;
D O I
10.1109/CC.2015.7084408
中图分类号
TN [电子技术、通信技术];
学科分类号
0809 ;
摘要
The similarity search is one of the fundamental components in time series data mining, e.g. clustering, classification, association rules mining. Many methods have been proposed to measure the similarity between time series, including Euclidean distance, Manhattan distance, and dynamic time warping (DTW). In contrast, DTW has been suggested to allow more robust similarity measure and be able to find the optimal alignment in time series. However, due to its quadratic time and space complexity, DTW is not suitable for large time series datasets. Many improving algorithms have been proposed for DTW search in large databases, such as approximate search or exact indexed search. Unlike the previous modified algorithm, this paper presents a novel parallel scheme for fast similarity search based on DTW, which is called MRDTW (MapRedcue-based DTW). The experimental results show that our approach not only retained the original accuracy as DTW, but also greatly improved the efficiency of similarity measure in large time series.
引用
收藏
页码:129 / 140
页数:12
相关论文
共 50 条
  • [1] A Novel Parallel Scheme for Fast Similarity Search in Large Time Series
    YIN Hong
    YANG Shuqiang
    MA Shaodong
    LIU Fei
    CHEN Zhikun
    China Communications, 2015, (02) : 129 - 140
  • [2] A Novel Parallel Scheme for Fast Similarity Search in Large Time Series
    YIN Hong
    YANG Shuqiang
    MA Shaodong
    LIU Fei
    CHEN Zhikun
    中国通信, 2015, 12 (02) : 129 - 140
  • [3] Indexing scheme for fast similarity search in large time series databases
    Keogh, Eamonn J.
    Pazzani, Michael J.
    Proceedings of the International Conference on Scientific and Statistical Database Management, SSDBM, 1999, : 56 - 67
  • [4] Dimensionality Reduction for Fast Similarity Search in Large Time Series Databases
    Eamonn Keogh
    Kaushik Chakrabarti
    Michael Pazzani
    Sharad Mehrotra
    Knowledge and Information Systems, 2001, 3 (3) : 263 - 286
  • [5] Fast online similarity search for uncertain time series
    Ma R.
    Zheng D.
    Yan L.
    Journal of Computing and Information Technology, 2020, 28 (01): : 1 - 17
  • [6] A simple dimensionality reduction technique for fast similarity search in large time series databases
    Keogh, EJ
    Pazzani, MJ
    KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS: CURRENT ISSUES AND NEW APPLICATIONS, 2000, 1805 : 122 - 133
  • [7] Parallelization of similarity search in large time series databases
    Qiao, Jonathan
    Ye, Yang
    Zhang, Chaoyang
    FIRST INTERNATIONAL MULTI-SYMPOSIUMS ON COMPUTER AND COMPUTATIONAL SCIENCES (IMSCCS 2006), PROCEEDINGS, VOL 1, 2006, : 355 - +
  • [8] Histogram Distance for Similarity Search in Large Time Series Database
    Ouyang, Yicun
    Zhang, Feng
    INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING - IDEAL 2010, 2010, 6283 : 170 - 177
  • [9] Fast similarity search in the presence of longitudinal scaling in time series databases
    Keogh, E
    NINTH IEEE INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 1997, : 578 - 584
  • [10] Prefix Similarity Search in Time Series Databases and a Scheme for Its Efficient Evaluation
    Feng, Yaokai
    Kaneko, Kunihiko
    WMSCI 2008: 12TH WORLD MULTI-CONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL VI, PROCEEDINGS, 2008, : 144 - 149