Historic Moments Discovery in Sequence Data

被引:1
|
作者
Bai, Ran [1 ]
Hon, Wing Kai [2 ]
Lo, Eric [3 ]
He, Zhian [4 ]
Zhu, Kenny [5 ]
机构
[1] Hong Kong Polytech Univ, Dept Comp, Hong Kong, Peoples R China
[2] Natl Tsing Hua Univ, Dept Comp Sci, Hsinchu, Taiwan
[3] Chinese Univ Hong Kong, Dept Comp Sci & Engn, Hong Kong, Peoples R China
[4] Univ Hong Kong, Dept Comp Sci, Hong Kong, Peoples R China
[5] Shanghai Jiao Tong Univ, Dept Comp Sci & Engn, Shanghai, Peoples R China
来源
ACM TRANSACTIONS ON DATABASE SYSTEMS | 2019年 / 44卷 / 01期
关键词
Historic moments; space optimal; prominent streaks; sequence data; SKYLINE;
D O I
10.1145/3276975
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Many emerging applications are based on finding interesting subsequences from sequence data. Finding "prominent streaks," a set of the longest contiguous subsequences with values all above (or below) a certain threshold, from sequence data is one of that kind that receives much attention. Motivated from real applications, we observe that prominent streaks alone are not insightful enough but require the discovery of something we coined as "historic moments" as companions. In this article, we present an algorithm to efficiently compute historic moments from sequence data. The algorithm is incremental and space optimal, meaning that when facing new data arrival, it is able to efficiently refresh the results by keeping minimal information. Case studies show that historic moments can significantly improve the insights offered by prominent streaks alone. Furthermore, experiments show that our algorithm can outperform the baseline in both time and space.
引用
收藏
页数:33
相关论文
共 50 条
  • [1] HISTORIC MOMENTS
    不详
    MONTHLY REVIEW-AN INDEPENDENT SOCIALIST MAGAZINE, 1988, 40 (01) : 1 - 8
  • [2] Data rescue: discovery and recovery of historic climate observations
    Cornes, Richard
    WEATHER, 2024, 79 (03) : 102 - 102
  • [3] Motif discovery in heterogeneous sequence data
    Prakash, A
    Blanchette, M
    Sinha, S
    Tompa, M
    PACIFIC SYMPOSIUM ON BIOCOMPUTING 2004, 2003, : 348 - 359
  • [4] MOMENTS OF DISCOVERY
    HODGKIN, D
    KRISTALLOGRAFIYA, 1981, 26 (05): : 1029 - 1045
  • [5] Moments of discovery
    Berg, Paul
    ANNUAL REVIEW OF BIOCHEMISTRY, 2008, 77 : 14 - 44
  • [6] The Historic Sequence of the Celts
    不详
    NATURE, 1934, 134 : 858 - 858
  • [7] Moments of Personal Discovery
    Garnett, A. Campbell
    ANNALS OF THE AMERICAN ACADEMY OF POLITICAL AND SOCIAL SCIENCE, 1953, 288 : 189 - 190
  • [8] Efficient and Accurate Discovery of Patterns in Sequence Data Sets
    Floratou, Avrilia
    Tata, Sandeep
    Patel, Jignesh M.
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2011, 23 (08) : 1154 - 1168
  • [9] Efficient Longest Streak Discovery in Multidimensional Sequence Data
    Wang, Wentao
    Tang, Bo
    Zhu, Min
    WEB AND BIG DATA (APWEB-WAIM 2018), PT II, 2018, 10988 : 166 - 181
  • [10] An event set approach to sequence discovery in medical data
    Ramirez, Jorge C.G.
    Cook, Diane J.
    Peterson, Lynn L.
    Peterson, Dolores M.
    Intelligent Data Analysis, 2000, 4 (06) : 513 - 530