Safe Pattern Pruning: An Efficient Approach for Predictive Pattern Mining

被引:22
|
作者
Nakagawa, Kazuya [1 ]
Suzumura, Shinya [1 ]
Karasuyama, Masayuki [1 ]
Tsuda, Koji [2 ]
Takeuchi, Ichiro [1 ]
机构
[1] Nagoya Inst Technol, Nagoya, Aichi, Japan
[2] Univ Tokyo, Tokyo, Japan
关键词
Predictive pattern mining; Graph mining; Item-set mining; Sparse learning; Safe screening; Convex optimization;
D O I
10.1145/2939672.2939844
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we study predictive pattern mining problems where the goal is to construct a predictive model based on a subset of predictive patterns in the database. Our main contribution is to introduce a novel method called safe pattern pruning (SPP) for a class of predictive pattern mining problems. The SPP method allows us to efficiently find a superset of all the predictive patterns in the database that are needed for the optimal predictive model. The advantage of the SPP method over existing boosting-type method is that the former can find the superset by a single search over the database, while the latter requires multiple searches. The SPP method is inspired by recent development of safe feature screening. In order to extend the idea of safe feature screening into predictive pattern mining, we derive a novel pruning rule called safe pattern pruning (SPP) rule that can be used for searching over the tree defined among patterns in the database. The SPP rule has a property that, if a node corresponding to a pattern in the database is pruned out by the SPP rule, then it is guaranteed that all the patterns corresponding to its descendant nodes are never needed for the optimal predictive model. We apply the SPP method to graph mining and item-set mining problems, and demonstrate its computational advantage.
引用
收藏
页码:1785 / 1794
页数:10
相关论文
共 50 条
  • [1] Efficient model selection for predictive pattern mining model by safe pattern pruning
    Yoshida, Takumi
    Hanada, Hiroyuki
    Nakagawa, Kazuya
    Taji, Kouichi
    Tsuda, Koji
    Takeuchi, Ichiro
    [J]. PATTERNS, 2023, 4 (12):
  • [2] An Efficient Candidate Pruning Technique for High Utility Pattern Mining
    Ahmed, Chowdhury Farhan
    Tanbeer, Syed Khairuzzaman
    Jeong, Byeong-Soo
    Lee, Young-Koo
    [J]. ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2009, 5476 : 749 - 756
  • [3] An Efficient Approach to Colossal Pattern Mining
    Dabbiru, Madhavi
    Shashi, Mogalla
    [J]. INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2010, 10 (01): : 304 - 312
  • [4] An Efficient Approach for Mining Sequential Pattern
    Pant, Nidhi
    Kant, Surya
    Pant, Bhaskar
    Sharma, Shashi Kumar
    [J]. PROCEEDINGS OF FIFTH INTERNATIONAL CONFERENCE ON SOFT COMPUTING FOR PROBLEM SOLVING (SOCPROS 2015), VOL 2, 2016, 437 : 587 - 596
  • [5] A PRUNING APPROACH TO PATTERN DISCOVERY
    Wang, Hsiao-Fan
    Chan, Zu-Wen
    [J]. INTERNATIONAL JOURNAL OF INFORMATION TECHNOLOGY & DECISION MAKING, 2008, 7 (04) : 721 - 736
  • [6] Effective pruning strategies for sequential pattern mining
    Xu Yusheng
    Ma Zhixin
    Li Lian
    Dillon, Tharam S.
    [J]. FIRST INTERNATIONAL WORKSHOP ON KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2007, : 21 - +
  • [7] A Computationally Efficient Approach for Temporal Pattern Mining in IoT
    Radhakrishna, Vangipuram
    Kumar, Puligadda Veereswara
    Janaki, Vinjamuri
    Aljawarneh, Shadi
    [J]. 2016 INTERNATIONAL CONFERENCE ON ENGINEERING & MIS (ICEMIS), 2016,
  • [8] From sequential pattern mining to structured pattern mining: A pattern-growth approach
    Han, JW
    Pei, J
    Yan, XF
    [J]. JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2004, 19 (03) : 257 - 279
  • [9] From sequential pattern mining to structured pattern mining: A pattern-growth approach
    Jia-Wei Han
    Jian Pei
    Xi-Feng Yan
    [J]. Journal of Computer Science and Technology, 2004, 19 : 257 - 279
  • [10] A fast ensemble pruning algorithm based on pattern mining process
    Zhao, Qiang-Li
    Jiang, Yan-Huang
    Xu, Ming
    [J]. DATA MINING AND KNOWLEDGE DISCOVERY, 2009, 19 (02) : 277 - 292