STAMP: On discovery of statistically important pattern repeats in long sequential data

被引:0
|
作者
Yang, J
Wang, W
Yu, PS
机构
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we focus on mining periodic patterns allowing some degree of imperfection in the form of random replacement from a perfect periodic pattern. In InfoMiner+, we proposed a new metric, namely generalized information gain, to identify patterns with events of vastly different occurrence frequencies and to adjust for the deviation from a pattern. In particular, a penalty is allowed to be associated with gaps between pattern occurrences. This is particularly useful in locating repeats in DNA sequences. In-this paper, we present an effective mining algorithm, STAMP, to simultaneously mine significant patterns and the associated subsequences under the model of generalized information gain.
引用
收藏
页码:224 / 235
页数:12
相关论文
共 50 条
  • [21] Discovery of dependency relations in sequential data flow
    Aldahami A.
    Li Y.
    Chan T.
    Aldahami, Abdulelah (abdulelah.aldahami@hdr.qut.edu.au), 1600, IOS Press BV (15): : 35 - 53
  • [22] Relation Discovery of Mobile Network Alarms with Sequential Pattern Mining
    Lozonavu, Mihaela
    Vlachou-Konchylaki, Martha
    Huang, Vincent
    2017 INTERNATIONAL CONFERENCE ON COMPUTING, NETWORKING AND COMMUNICATIONS (ICNC), 2016, : 363 - 367
  • [23] Circadian Pattern Analysis of Psychophysiological Data: History Repeats Itself
    McGrath, Jennifer J.
    Idris, Muhammed Y.
    Kanji, Amyna
    PSYCHOSOMATIC MEDICINE, 2018, 80 (03): : A135 - A135
  • [24] CLOSED SEQUENTIAL PATTERN MINING IN BIOLOGICAL DATA
    Jawahar, S.
    Harishchander, A.
    Devaraju, S.
    Ali, S. Ahamed Johnsha
    Manivasagan, C.
    Sumathi, P.
    INTERNATIONAL JOURNAL OF LIFE SCIENCE AND PHARMA RESEARCH, 2020, : 9 - 13
  • [25] Automatic Sequential Pattern Mining in Data Streams
    Kawabata, Koki
    Matsubara, Yasuko
    Sakurai, Yasushi
    PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM '19), 2019, : 1733 - 1742
  • [26] PREDICTING SEQUENTIAL PATTERN CHANGES IN DATA STREAMS
    Li, I-Hui
    Huang, Jyun-Yao
    Liao, I-En
    INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2012, 8 (1A): : 285 - 302
  • [27] Data preprocessing by sequential pattern mining for LZW
    Vergara-Villegas, OO
    García-Hernández, RA
    Carrasco-Ochoa, JA
    Elías, RP
    Martínez-Trinidad, JF
    Sixth Mexican International Conference on Computer Science, Proceedings, 2005, : 82 - 87
  • [28] Sequential Pattern Mining from Stream Data
    Koper, Adam
    Hung Son Nguyen
    ADVANCED DATA MINING AND APPLICATIONS, PT II, 2011, 7121 : 278 - 291
  • [29] Leveraging Sequential Pattern Information for Active Learning from Sequential Data
    Fidalgo-Merino, Raul
    Gabrielli, Lorenzo
    Checchi, Enrico
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 6957 - 6964
  • [30] Knowledge Discovery from Web Usage Data: Research and Development of Web Access Pattern Tree Based Sequential Pattern Mining Techniques: A Survey
    Shivaprasad, G.
    Subbareddy, N. V.
    Acharya, U. Dinesh
    INTERNATIONAL CONFERENCE ON METHODS AND MODELS IN SCIENCE AND TECHNOLOGY (ICM2ST-10), 2010, 1324 : 319 - 323