STAMP: On discovery of statistically important pattern repeats in long sequential data

被引:0
|
作者
Yang, J
Wang, W
Yu, PS
机构
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we focus on mining periodic patterns allowing some degree of imperfection in the form of random replacement from a perfect periodic pattern. In InfoMiner+, we proposed a new metric, namely generalized information gain, to identify patterns with events of vastly different occurrence frequencies and to adjust for the deviation from a pattern. In particular, a penalty is allowed to be associated with gaps between pattern occurrences. This is particularly useful in locating repeats in DNA sequences. In-this paper, we present an effective mining algorithm, STAMP, to simultaneously mine significant patterns and the associated subsequences under the model of generalized information gain.
引用
收藏
页码:224 / 235
页数:12
相关论文
共 50 条
  • [1] Statistically Sound Pattern Discovery
    Hamalainen, Wilhelmiina
    Webb, Geoffrey, I
    PROCEEDINGS OF THE 20TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING (KDD'14), 2014, : 1976 - 1976
  • [2] A tutorial on statistically sound pattern discovery
    Hamalainen, Wilhelmiina
    Webb, Geoffrey I.
    DATA MINING AND KNOWLEDGE DISCOVERY, 2019, 33 (02) : 325 - 377
  • [3] A tutorial on statistically sound pattern discovery
    Wilhelmiina Hämäläinen
    Geoffrey I. Webb
    Data Mining and Knowledge Discovery, 2019, 33 : 325 - 377
  • [4] Statistically-sound Knowledge Discovery from Data
    Riondato, Matteo
    PROCEEDINGS OF THE 2023 SIAM INTERNATIONAL CONFERENCE ON DATA MINING, SDM, 2023, : 949 - 952
  • [5] Sequential Pattern Discovery for Weather Prediction Problem
    Alshareef, Almahdi
    Abu Bakar, Azuraliza
    Hamdan, Abdul Razak
    Abdullah, Sharifah Mastura Syed
    Jaafar, Othman
    EMERGING TRENDS AND ADVANCED TECHNOLOGIES FOR COMPUTATIONAL INTELLIGENCE, 2016, 647 : 223 - 240
  • [6] Distinguishing surgical behavior by sequential pattern discovery
    Huaulme, Arnaud
    Voros, Sandrine
    Riffaud, Laurent.
    Forestier, Germain
    Moreau-Gaudry, Alexandre
    Jannin, Pierre
    JOURNAL OF BIOMEDICAL INFORMATICS, 2017, 67 : 34 - 41
  • [7] Automatic discovery of generalized sequential pattern in databases
    Ou-Yang, Weimin
    Cai, Qingsheng
    Ruan Jian Xue Bao/Journal of Software, 1997, 8 (11): : 864 - 870
  • [8] Explainable Long and Short-Term Pattern Detection in Projected Sequential Data
    Bittner, Matthias
    Hinterreiter, Andreas
    Eckelt, Klaus
    Streit, Marc
    MACHINE LEARNING AND PRINCIPLES AND PRACTICE OF KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2023, PT III, 2025, 2135 : 53 - 68
  • [9] Sequential Pattern Discovery Algorithm for Malaysia Rainfall Prediction
    Ahmed, A. M.
    Bakar, A. A.
    Hamdan, A. R.
    Abdullah, S. M. Syed
    Jaafar, O.
    ACTA PHYSICA POLONICA A, 2015, 128 (2B) : B324 - B326
  • [10] KAPPA AS A MEASURE OF PATTERN IN SEQUENTIAL DATA
    WAMPOLD, BE
    QUALITY & QUANTITY, 1989, 23 (02) : 171 - 187