Fast algorithm for high utility pattern mining with the sum of item quantities

被引:35
|
作者
Ryang, Heungmo [1 ]
Yun, Unil [1 ]
Ryu, Keun Ho [2 ]
机构
[1] Sejong Univ, Dept Comp Engn, Seoul, South Korea
[2] Chungbuk Natl Univ, Dept Comp Sci, Cheongju, South Korea
基金
新加坡国家研究基金会;
关键词
Data mining; high utility patterns; single-pass tree construction; tree restructuring; utility mining; FREQUENT ITEMSETS;
D O I
10.3233/IDA-160811
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In frequent pattern mining, items are considered as having the same importance in a database and their occurrence are represented as binary values in transactions. In real-world databases, however, items not only have relative importance but also are represented as non-binary values in transactions. High utility pattern mining is one of the most essential issues in the pattern mining field, which recently emerged to address the limitation of frequent pattern mining. Meanwhile, tree construction with a single database scan is significant since a database scan is a time-consuming task. In utility mining, an additional database scan is necessary to identify actual high utility patterns from candidates. In this paper, we propose a novel tree structure, namely SIQ-Tree (Sum of Item Quantities), which captures database information through a single-pass. Moreover, a restructuring method is suggested with strategies for reducing overestimated utilities. The proposed algorithm can construct the SIQ-Tree with only a single scan and decrease the number of candidate patterns effectively with the reduced overestimation utilities, through which mining performance is improved. Experimental results show that our algorithm outperforms a state-of-the-art one in terms of runtime and the number of generated candidates with a similar memory usage.
引用
收藏
页码:395 / 415
页数:21
相关论文
共 50 条
  • [21] UP-GNIV: An expeditious high utility pattern mining algorithm for itemsets with negative utility values
    Department of CSE, Coimbatore Institute of Engineering and Technology, Coimbatore, Tamil Nadu
    641109, India
    不详
    638401, India
    Int. J. Inf. Technol. Manage., 1 (26-42):
  • [22] A fast distributed mining algorithm for association rules with item constraints
    Wang, CH
    Huang, HK
    Li, HL
    SMC 2000 CONFERENCE PROCEEDINGS: 2000 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN & CYBERNETICS, VOL 1-5, 2000, : 1900 - 1905
  • [23] A Survey of Correlated High Utility Pattern Mining
    Almoqbily, Rashad S.
    Rauf, Azhar
    Quradaa, Fahmi H.
    IEEE ACCESS, 2021, 9 : 42786 - 42800
  • [24] ProUM: High Utility Sequential Pattern Mining
    Gan, Wensheng
    Lin, Jerry Chun-Wei
    Zhang, Jiexiong
    Chao, Han-Chieh
    Fujita, Hamido
    Yu, Philip S.
    2019 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC), 2019, : 767 - 773
  • [25] On Incremental High Utility Sequential Pattern Mining
    Wang, Jun-Zhe
    Huang, Jiun-Long
    ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2018, 9 (05)
  • [26] Mining High Utility Sequential Patterns with Negative Item Values
    Xu, Tiantian
    Dong, Xiangjun
    Xu, Jianliang
    Dong, Xue
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2017, 31 (10)
  • [27] A New Algorithm for Mining High Utility Sequential Patterns Based on Pattern-growth
    基于模式增长的高效用序列模式挖掘算法
    Tang, Hui-Jun (totti_2018@sina.com), 1600, Science Press (47): : 943 - 954
  • [28] A Generalized Weighted Closed Sequential Pattern Mining Algorithm with Item Interval
    Lu, Haitao
    Li, Shuo
    PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON AUTOMATION, MECHANICAL CONTROL AND COMPUTATIONAL ENGINEERING, 2015, 124 : 1599 - 1604
  • [29] A fast and highly scalable frequent pattern mining algorithm
    Cheng, Wan-Shu
    Lin, Yi-Ting
    Huang, Peng-Yu
    Chen, Ju-Chin
    Lin, Kawuu W.
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2024, 160 : 854 - 868
  • [30] Fast algorithm for mining item profit in retails based on microeconomic view
    Xu, XJ
    Jia, LF
    Zhe, W
    Zhang, HY
    Liang, S
    Zhou, CG
    2005 INTERNATIONAL CONFERENCE ON CYBERWORLDS, PROCEEDINGS, 2005, : 349 - 353