Efficient subsequence matching using the Longest Common Subsequence with a Dual Match index

被引:0
|
作者
Han, Tae Sik
Ko, Seung-Kyu
Kang, Jaewoo [1 ,1 ]
机构
[1] Korea Univ, Dept Comp Sci & Engn, Seoul 136705, South Korea
关键词
subsequence matching; Longest Common Subsequence; Dual Match;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The purpose of subsequence matching is to find a query sequence from a long data sequence. Due to the abundance of applications, many solutions have been proposed. Virtually all previous solutions use the Euclidean measure as the basis for measuring distance between sequences. Recent studies, however, suggest that the Euclidean distance often fails to produce proper results due to the irregularity in the data, which is not so uncommon in our problem domain. Addressing this problem, some non-Euclidean measures, such as Dynamic Time Warping (DTW) and Longest Common Subsequence (LCS), have been proposed. However, most of the previous work in this direction focused on the whole sequence matching problem where query and data sequences are the same length. In this paper, we propose a novel subsequence matching framework using a non-Euclidean measure, in particular, LCS, and a new index query scheme. The proposed framework is based on the Dual Match framework where data sequences are divided into a series of disjoint equi-length subsequences and then indexed in an R-tree. We introduced similarity bound for index matching with LCS. The proposed query matching scheme reduces significant numbers of false positives in the match result. Furthermore, we developed an algorithm to skip expensive LCS computations through observing the warping paths. We validated our framework through extensive experiments using 48 different time series datasets. The results of the experiments suggest that our approach significantly improves the subsequence matching performance in various metrics.
引用
收藏
页码:585 / +
页数:2
相关论文
共 50 条
  • [1] Efficient Computation for the Longest Common Subsequence with Substring Inclusion and Subsequence Exclusion Constraints
    Wang, Xiaodong
    Zhu, Daxin
    [J]. SMART COMPUTING AND COMMUNICATION, SMARTCOM 2016, 2017, 10135 : 419 - 428
  • [2] Music Matching Based on Rough Longest Common Subsequence
    Lin, Hwei-Jen
    Wu, Hung-Hsuan
    Wang, Chun-Wei
    [J]. JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2011, 27 (01) : 95 - 110
  • [3] On longest matching consecutive subsequence
    Li, Jinjun
    Yang, Xiangfeng
    [J]. INTERNATIONAL JOURNAL OF NUMBER THEORY, 2019, 15 (08) : 1745 - 1758
  • [4] Palindromic Subsequence Automata and Longest Common Palindromic Subsequence
    Hasan M.M.
    Islam A.S.M.S.
    Rahman M.S.
    Sen A.
    [J]. Mathematics in Computer Science, 2017, 11 (2) : 219 - 232
  • [5] Cyclic longest common subsequence
    Naiman, Aaron E.
    Farber, Eliav
    Stein, Yossi
    [J]. DISCRETE MATHEMATICS ALGORITHMS AND APPLICATIONS, 2023, 15 (04)
  • [6] Exemplar longest common subsequence
    Bonizzoni, Paola
    Della Vedova, Gianluca
    Dondi, Riccardo
    Fertin, Guillaume
    Rizzi, Raffaella
    Vialette, Stephane
    [J]. IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2007, 4 (04) : 535 - 543
  • [7] On the longest common parameterized subsequence
    Keller, Orgad
    Kopelowitz, Tsvi
    Lewenstein, Moshe
    [J]. COMBINATORIAL PATTERN MATCHING, 2008, 5029 : 303 - +
  • [8] Efficient algorithms for finding a longest common increasing subsequence
    Chan, WT
    Zhang, Y
    Fung, SPY
    Ye, DS
    Zhu, H
    [J]. ALGORITHMS AND COMPUTATION, 2005, 3827 : 665 - 674
  • [9] On the longest common parameterized subsequence
    Keller, Orgad
    Kopelowitz, Tsvi
    Lewenstein, Moshe
    [J]. THEORETICAL COMPUTER SCIENCE, 2009, 410 (51) : 5347 - 5353
  • [10] An efficient algorithm for the longest common palindromic subsequence problem
    Liang, Ting-Wei
    Yang, Chang-Biau
    Huang, Kuo-Si
    [J]. THEORETICAL COMPUTER SCIENCE, 2022, 922 : 475 - 485