Efficient time-series subsequence matching using duality in constructing windows

被引:12
|
作者
Moon, YS [1 ]
Whang, KY
Loh, WK
机构
[1] Korea Adv Inst Sci & Technol, Dept Comp Sci, Taejon 305701, South Korea
[2] Korea Adv Inst Sci & Technol, Adv Informat Technol Res Ctr, Taejon 305701, South Korea
关键词
duality; data mining; subsequence matching; time-series data; similarity search;
D O I
10.1016/S0306-4379(01)00021-7
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we propose a new subsequence matching method, Dual Match. Dual Match exploits duality in constructing windows and significantly improves performance. Dual Match divides data sequences into disjoint windows and the query sequence into sliding windows, and thus, is a dual approach of the one by Faloutsos et al. (Proceedings of the ACM SIGMOD International Conference on Management of Data, Seattle, Washington, 1994, pp. 419-429.) (FRM in short), which divides data sequences into sliding windows and the query sequence into disjoint windows. FRM causes a lot of false alarms (i.e., candidates that do not qualify) by storing minimum bounding rectangles rather than individual points representing windows to save storage space for the index. Dual Match solves this problem by directly storing points without incurring excessive storage overhead. Experimental results show that, in most cases, Dual Match provides large improvement both in false alarms and performance over FRM given the same amount of storage space. In particular, for low selectivities (less than 10(-4)), Dual Match significantly improves performance up to 430-fold. On the other hand, for high selectivities (more than 10(-2)), it shows a very minor degradation (less than 29%). For selectivities in between (10(-4)-10(-2)), Dual Match shows performance slightly better than that of FRM. Overall, these results indicate that our approach provides a new paradigm in subsequence matching that improves performance significantly in large database applications. (C) 2001 Elsevier Science Ltd. All rights reserved.
引用
收藏
页码:279 / 293
页数:15
相关论文
共 50 条
  • [1] Duality-based subsequence matching in time-series databases
    Moon, YS
    Whang, KY
    Loh, WK
    17TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 2001, : 263 - 272
  • [2] Using multiple indexes for efficient subsequence matching in time-series databases
    Lim, Seung-Hwan
    Park, Heejin
    Kim, Sang-Wook
    INFORMATION SCIENCES, 2007, 177 (24) : 5691 - 5706
  • [3] Using multiple indexes for efficient subsequence matching in time-series databases
    Lim, Seung-Hwan
    Park, Hee-Jin
    Kim, Sang-Wook
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, PROCEEDINGS, 2006, 3882 : 65 - 79
  • [4] Efficient processing of subsequence matching with the Euclidean metric in time-series databases
    Kim, SW
    Park, DH
    Lee, HG
    INFORMATION PROCESSING LETTERS, 2004, 90 (05) : 253 - 260
  • [5] Linear Detrending Subsequence Matching in Time-Series Databases
    Gil, Myeong-Seon
    Moon, Yang-Sae
    Kim, Bum-Soo
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2011, E94D (04) : 917 - 920
  • [6] Fast fuzzy subsequence matching algorithms on time-series
    Gong, Xueyuan
    Fong, Simon
    Si, Yain-Whar
    EXPERT SYSTEMS WITH APPLICATIONS, 2019, 116 : 275 - 284
  • [7] Quantizing time series for efficient subsequence matching
    Vega-Lopez, Ines F.
    Moon, Bongki
    PROCEEDINGS OF THE IASTED INTERNATIONAL CONFERENCE ON DATABASES AND APPLICATIONS, 2006, : 209 - +
  • [8] Efficient stream subsequence matching algorithms for handheld devices on streaming time-series data
    Moon, Yang-Sae
    Loh, Woong-Kee
    COMPUTER SYSTEMS SCIENCE AND ENGINEERING, 2009, 24 (04): : 273 - 284
  • [9] Efficient stream subsequence matching algorithms for handheld devices on streaming time-series data
    Moon, Yang-Sae
    Loh, Woong-Kee
    COMPUTER SYSTEMS SCIENCE AND ENGINEERING, 2009, 24 (02): : 115 - 127
  • [10] FAST SUBSEQUENCE MATCHING UNDER TIME WARPING IN TIME-SERIES DATABASES
    Liu, Xiao-Ying
    Ren, Chuan-Lun
    PROCEEDINGS OF 2013 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS (ICMLC), VOLS 1-4, 2013, : 1584 - 1590