Efficient mining of extraordinary patterns by pruning and predicting

被引:8
|
作者
Liu, Junqiang [1 ]
Chang, Zhongmin [1 ]
Leung, Carson K. S. [2 ]
Wong, Raymond C. W. [3 ]
Xu, Yabo [4 ]
Zhao, Rong [1 ]
机构
[1] Zhejiang Gongshang Univ, Sch Informat & Elect Engn, Hangzhou 310018, Zhejiang, Peoples R China
[2] Univ Manitoba, Dept Comp Sci, Winnipeg, MB R3T 2N2, Canada
[3] Hong Kong Univ Sci & Technol, Dept Comp Sci & Engn, Kowloon, Hong Kong, Peoples R China
[4] Guangzhou DataStory Informat Technol Ltd, Bldg A,100 Huangpu Dadao, Guangzhou, Guangdong, Peoples R China
基金
中国国家自然科学基金;
关键词
Data mining; Pattern mining; Frequent patterns; High utility patterns; Extraordinary patterns; HIGH UTILITY ITEMSETS; ALGORITHMS;
D O I
10.1016/j.eswa.2019.01.079
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Pattern mining is an important data mining technology. The existing pattern mining algorithms mainly focus on discovery of ordinary patterns in databases, for example, frequent pattern mining finds patterns with high frequencies and utility pattern mining discovers patterns with high utilities. However, in many real-world applications, people are more interested in finding extraordinary patterns with low frequencies and high utilities or with high frequencies and low utilities. While mining ordinary patterns is computationally hard, it is even harder to mine extraordinary patterns. In particular, a two-phase approach that first generates and materializes candidates (high-frequency patterns or high-utility patterns) and then finds extraordinary patterns from the candidates, suffers from the scalability and efficiency bottlenecks. This paper proposes an efficient algorithm for mining extraordinary patterns. The novelty of our algorithm lies in newly proposed lower bounds both on frequencies and on utilities of patterns, new pruning strategies and new predicting strategies for dramatically reducing the search space, and a novel data structure for efficient computation. The proposed algorithm employs a single-phase approach without materializing candidates, and also adapts the upper bounds on supports and on utilities for pruning. Extensive experiments show that the new pruning and predicting strategies are effective, and the proposed algorithm is efficient and scalable. (C) 2019 Elsevier Ltd. All rights reserved.
引用
收藏
页码:55 / 68
页数:14
相关论文
共 50 条
  • [1] Mining frequent closed patterns by adaptive pruning
    Liu, Jun-Qiang
    Sun, Xiao-Ying
    Zhuang, Yue-Ting
    Pan, Yun-He
    [J]. Ruan Jian Xue Bao/Journal of Software, 2004, 15 (01): : 94 - 102
  • [2] A Novel Pruning Strategy for Mining Discriminative Patterns
    Aryabarzan, Nader
    Minaei-Bidgoli, Behrouz
    [J]. IRANIAN JOURNAL OF SCIENCE AND TECHNOLOGY-TRANSACTIONS OF ELECTRICAL ENGINEERING, 2021, 45 (02) : 505 - 527
  • [3] A Novel Pruning Strategy for Mining Discriminative Patterns
    Nader Aryabarzan
    Behrouz Minaei-Bidgoli
    [J]. Iranian Journal of Science and Technology, Transactions of Electrical Engineering, 2021, 45 : 505 - 527
  • [4] AN EFFICIENT GRADUAL PRUNING TECHNIQUE FOR UTILITY MINING
    Lan, Guo-Cheng
    Hong, Tzung-Pei
    Tseng, Vincent S.
    [J]. INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2012, 8 (7B): : 5165 - 5178
  • [5] An efficient pruning method for mining inter-sequence patterns based on pseudo-IDList
    Nguyen, Anh
    Nguyen, Ngoc Thanh
    Nguyen, Loan T. T.
    Vo, Bay
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2024, 238
  • [6] Efficient algorithms for simultaneously mining concise representations of sequential patterns based on extended pruning conditions
    Hai Duong
    Tin Truong
    Bac Le
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2018, 67 : 197 - 210
  • [7] Efficient list based mining of high average utility patterns with maximum average pruning strategies
    Kim, Heonho
    Yun, Unil
    Baek, Yoonji
    Kim, Jongseong
    Vo, Bay
    Yoon, Eunchul
    Fujita, Hamido
    [J]. INFORMATION SCIENCES, 2021, 543 : 85 - 105
  • [8] An Efficient Pruning Approach for Class Association Rule Mining
    Nguyen, Loan T. T.
    Nguyen, Thang N.
    [J]. ADVANCES IN INTELLIGENT DECISION TECHNOLOGIES, 2010, 4 : 559 - +
  • [9] An efficient pruning approach for class association rule mining
    Nguyen L.T.T.
    Nguyen T.N.
    [J]. Smart Innovation, Systems and Technologies, 2010, 4 : 559 - 570
  • [10] Efficient algorithms for mining top-rank-k erasable patterns using pruning strategies and the subsume concept
    Tuong Le
    Bay Vo
    Baik, Sung Wook
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2018, 68 : 1 - 9