Mining Weighted Sequential Patterns Based on Prefix-Tree and Prism Encoding

被引:1
|
作者
Pham, Thi-Thiet [1 ]
Vu, Thuy-Duong [2 ]
Nguyen, Tai-Du [3 ]
Huynh, Bao [4 ]
Van, Trang [5 ]
机构
[1] Ind Univ Ho Chi Minh City, Fac Informat Technol, Ho Chi Minh City, Vietnam
[2] FPT Univ, Dept Comp Fundamental, Ho Chi Minh City, Vietnam
[3] UNI ASIA Co Ltd, Ho Chi Minh City, Vietnam
[4] HUTECH Univ, Fac Informat Technol, Ho Chi Minh City, Vietnam
[5] Ho Chi Minh City Univ Econ & Finance UEF, Fac Informat Technol, Ho Chi Minh City, Vietnam
关键词
Sequential pattern; weighted constraints; sequence database; prefix tree; EFFICIENT APPROACH;
D O I
10.1142/S219688882350001X
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The purpose of mining sequential patterns problem with weighted constraints is to find high-valued patterns, including infrequent patterns but having items which appear in the pattern of high importance in the sequence database (SD). Therefore, weighted sequential pattern mining will collect a set of more complete patterns with items of low support but of high importance. This paper proposes a new algorithm called WSPM_PreTree to find highly weighted sequential patterns. To collect a set of complete sequential patterns with the stricter weighted constraints of sequential patterns, the proposed algorithm uses both the minimum support constraint and the actual values of items appearing in the SD. To increase the performance of the finding weighted sequential patterns process, the algorithm uses the parent-child relationship on the prefix tree structure to create candidates and combines the weighted mean of the sequential 1-patterns that is calculated from the actual value of items in the SD as conditions to find the weighted sequential patterns. Experimental results show that the proposed algorithm is more efficient than sequential patterns mining with weight constraint (SPMW) algorithm [Ref. 20] in the runtime.
引用
收藏
页码:357 / 372
页数:16
相关论文
共 50 条
  • [1] Mining Sequential Rules Based on Prefix-Tree
    Thien-Trang Van
    Bay Vo
    Bac Le
    NEW CHALLENGES FOR INTELLIGENT INFORMATION AND DATABASE SYSTEMS, 2011, 351 : 147 - +
  • [2] Ascending frequency ordered prefix-tree: Efficient mining of frequent patterns
    Liu, GM
    Lu, HJ
    Xu, YB
    Yu, JX
    EIGHTH INTERNATIONAL CONFERENCE ON DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, PROCEEDINGS, 2003, : 65 - 72
  • [3] Efficient Mining of Frequent Patterns Using Ascending Frequency Ordered Prefix-Tree
    Guimei Liu
    Hongjun Lu
    Wenwu Lou
    Yabo Xu
    Jeffrey Xu Yu
    Data Mining and Knowledge Discovery, 2004, 9 (3) : 249 - 274
  • [4] Efficient Mining of Frequent Patterns Using Ascending Frequency Ordered Prefix-Tree
    Guimei Liu
    Hongjun Lu
    Wenwu Lou
    Yabo Xu
    Jeffrey Xu Yu
    Data Mining and Knowledge Discovery, 2004, 9 : 249 - 274
  • [5] Efficient mining of frequent patterns using ascending frequency ordered prefix-tree
    Liu, GM
    Lu, HJ
    Lou, WW
    DATA MINING AND KNOWLEDGE DISCOVERY, 2004, 9 (03) : 249 - 274
  • [6] Incremental mining of sequential patterns using prefix tree
    Chen, Yue
    Guo, Jiankui
    Wang, Yaqin
    Xiong, Yun
    Zhu, Yangyong
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2007, 4426 : 433 - +
  • [7] Frequent Patterns Algorithm of Biological Sequences based on Pattern Prefix-tree
    Xue, L. Y.
    Zhang, X. K.
    Xie, F.
    Liu, S.
    Lin, P.
    INTERNATIONAL JOURNAL OF COMPUTERS COMMUNICATIONS & CONTROL, 2019, 14 (04) : 574 - 589
  • [8] On a Parallel Spark Workflow for Frequent Itemset Mining Based on Array Prefix-Tree
    Niu, Xinzheng
    Qian, Mideng
    Wu, Chase Q.
    Hou, Aiqin
    PROCEEDINGS OF WORKS19: THE 2019 14TH IEEE/ACM WORKFLOWS IN SUPPORT OF LARGE-SCALE SCIENCE (WORKS), 2019, : 50 - 59
  • [9] On a parallel spark workflow for frequent itemset mining based on array prefix-tree
    Niu, Xinzheng
    Wu, Peng
    Wu, Chase Q.
    Hou, Aiqin
    Qian, Mideng
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2022, 34 (14):
  • [10] Efficient single-pass frequent pattern mining using a prefix-tree
    Tanbeer, Syed Khairuzzaman
    Ahmed, Chowdhury Farhan
    Jeong, Byeong-Soo
    Lee, Young-Koo
    INFORMATION SCIENCES, 2009, 179 (05) : 559 - 583