Safe Pattern Pruning: An Efficient Approach for Predictive Pattern Mining

被引:22
|
作者
Nakagawa, Kazuya [1 ]
Suzumura, Shinya [1 ]
Karasuyama, Masayuki [1 ]
Tsuda, Koji [2 ]
Takeuchi, Ichiro [1 ]
机构
[1] Nagoya Inst Technol, Nagoya, Aichi, Japan
[2] Univ Tokyo, Tokyo, Japan
关键词
Predictive pattern mining; Graph mining; Item-set mining; Sparse learning; Safe screening; Convex optimization;
D O I
10.1145/2939672.2939844
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we study predictive pattern mining problems where the goal is to construct a predictive model based on a subset of predictive patterns in the database. Our main contribution is to introduce a novel method called safe pattern pruning (SPP) for a class of predictive pattern mining problems. The SPP method allows us to efficiently find a superset of all the predictive patterns in the database that are needed for the optimal predictive model. The advantage of the SPP method over existing boosting-type method is that the former can find the superset by a single search over the database, while the latter requires multiple searches. The SPP method is inspired by recent development of safe feature screening. In order to extend the idea of safe feature screening into predictive pattern mining, we derive a novel pruning rule called safe pattern pruning (SPP) rule that can be used for searching over the tree defined among patterns in the database. The SPP rule has a property that, if a node corresponding to a pattern in the database is pruned out by the SPP rule, then it is guaranteed that all the patterns corresponding to its descendant nodes are never needed for the optimal predictive model. We apply the SPP method to graph mining and item-set mining problems, and demonstrate its computational advantage.
引用
下载
收藏
页码:1785 / 1794
页数:10
相关论文
共 50 条
  • [41] An Efficient Close Frequent Pattern Mining Algorithm
    Tan, Jun
    Bu, Yingyong
    Yang, Bo
    ICICTA: 2009 SECOND INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTATION TECHNOLOGY AND AUTOMATION, VOL I, PROCEEDINGS, 2009, : 528 - 531
  • [42] AN EFFICIENT ALGORITHM FOR MINING CAUSALITY LEAST PATTERN
    Abdullah, Zailani
    Deris, Mustafa Mat
    Jaafar, S. Zulaikha
    PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER THEORY AND ENGINEERING (ICACTE 2009), VOLS 1 AND 2, 2009, : 1103 - 1110
  • [43] Efficient Failure Pattern Identification of Predictive Algorithms
    Nguyen, Bao
    Nguyen, Viet Anh
    UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, 2023, 216 : 1534 - 1544
  • [44] A pruning pattern list approach to the permutation flowshop scheduling problem
    Yamada, T
    ESSAYS AND SURVEYS IN METAHEURISTICS, 2002, 15 : 641 - 651
  • [45] Prefix-Pruning-Based Distributed Frequent Trajectory Pattern Mining Algorithm
    Ding, Jiaman
    Li, Yunpeng
    Li, Ling
    Jia, Lianyin
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2022, 2022
  • [46] WSpCPs: Weighted Sequential Pattern Mining based on Cluster-Pruning Mechanism
    Fu, Yu
    Yu, Yanhua
    Song, Meina
    2013 INTERNATIONAL CONFERENCE ON COMPUTATIONAL PROBLEM-SOLVING (ICCP), 2013, : 291 - 294
  • [47] A pattern mining approach for information filtering systems
    Yuefeng Li
    Abdulmohsen Algarni
    Yue Xu
    Information Retrieval, 2011, 14 : 237 - 256
  • [48] A Study of the SEMINTEC Approach to Frequent Pattern Mining
    Jozefowska, Joanna
    Lawrynowicz, Agnieszka
    Lukaszewski, Tomasz
    KNOWLEDGE DISCOVERY ENHANCED WITH SEMANTIC AND SOCIAL INFORMATION, 2009, 220 : 37 - 51
  • [49] A pattern-growth approach for mining trajectories
    Khatir, Mohammed Rachid
    Lebbah, Yahia
    Nourine, Rachid
    MULTIAGENT AND GRID SYSTEMS, 2020, 16 (02) : 117 - 133
  • [50] An Improved Approach for Sequential Utility Pattern Mining
    Lan, Guo-Cheng
    Hong, Tzung-Pei
    Tseng, Vincent S.
    Wang, Shyue-Liang
    2012 IEEE INTERNATIONAL CONFERENCE ON GRANULAR COMPUTING (GRC 2012), 2012, : 226 - 230