Parallelization of similarity search in large time series databases

被引:0
|
作者
Qiao, Jonathan [1 ]
Ye, Yang [2 ]
Zhang, Chaoyang [2 ]
机构
[1] Converse Coll, Spartanburg, SC 29302 USA
[2] Univ South Mississippi, Hattiesburg 39401, MS USA
关键词
D O I
10.1109/IMSCCS.2006.100
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In this paper, an efficient parallel algorithm to search large time series databases is proposed. There are existing parallel algorithms for performing such tasks, which generally utilize multidimensional tree structures and thus are subjected to the performance of multidimensional trees. On the other hand, there have been a number of serial algorithms proposed in the past decade. Most of them use certain transformation techniques to reduce the dimensionality and then build an index to facilitate the search process. This again results in performance degradation. This work develops a parallel algorithm to process range query and k-nearest neighbor query in parallel time series databases, assuming a shared nothing multi-processor architecture. Both analytical and experimental results show that the new approach has near linear scaleup and linear speedup with little more effort than non-index based sequential scan and thus another alternative to index based approach.
引用
收藏
页码:355 / +
页数:2
相关论文
共 50 条
  • [31] MidiFind: Similarity Search and Popularity Mining in Large MIDI Databases
    Xia, Guangyu
    Huang, Tongbo
    Ma, Yifei
    Dannenberg, Roger
    Faloutsos, Christos
    [J]. SOUND, MUSIC, AND MOTION, 2014, 8905 : 259 - 276
  • [32] A fast heuristic algorithm for similarity search in large DNA databases
    Jeong, In-Seon
    Park, Kyoung-Wook
    Lim, Hyeong-Seok
    [J]. PROCEEDINGS OF THE FRONTIERS IN THE CONVERGENCE OF BIOSCIENCE AND INFORMATION TECHNOLOGIES, 2007, : 335 - 340
  • [33] Efficient similarity search in large databases of tree structured objects
    Kailing, K
    Kriegel, HP
    Schönauer, S
    Seidl, T
    [J]. 20TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 2004, : 835 - 835
  • [34] Speeding Up Similarity Search on a Large Time Series Dataset under Time Warping Distance
    Ruengronghirunya, Pongsakorn
    Niennattrakul, Vit
    Ratanamahatana, Chotirat Ann
    [J]. ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2009, 5476 : 981 - 988
  • [35] Probabilistic Similarity Search for Uncertain Time Series
    Assfalg, Johannes
    Kriegel, Hans-Peter
    Kroeger, Peer
    Benz, Matthias
    [J]. SCIENTIFIC AND STATISTICAL DATABASE MANAGEMENT, PROCEEDINGS, 2009, 5566 : 435 - 443
  • [36] TimeExplorer: Similarity Search Time Series by Their Signatures
    Tuan Nhon Dang
    Wilkinson, Leland
    [J]. ADVANCES IN VISUAL COMPUTING, ISVC 2013, PT I, 2013, 8033 : 280 - 289
  • [37] Similarity search in trajectory Databases
    Pelekis, Nikos
    Kopanakis, Ioannis
    Marketos, Gerasimos
    Ntoutsi, Irene
    Andrienko, Gennady
    Theodoridis, Yannis
    [J]. TIME 2007: 14TH INTERNATIONAL SYMPOSIUM ON TEMPORAL REPRESENTATION AND REASONING, PROCEEDINGS, 2007, : 129 - +
  • [38] Similarity search in multimedia databases
    Keim, DA
    Bustos, B
    [J]. 20TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 2004, : 873 - 873
  • [39] EmbAssi: embedding assignment costs for similarity search in large graph databases
    Franka Bause
    Erich Schubert
    Nils M. Kriege
    [J]. Data Mining and Knowledge Discovery, 2022, 36 : 1728 - 1755
  • [40] Application of Kernel Functions for Accurate Similarity Search in Large Chemical Databases
    Wang, Xiaohong
    Huan, Jun
    Smalter, Aaron
    Lushington, Gerald H.
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2009, : 356 - +