Towards a Compact SAT-Based Encoding of Itemset Mining Tasks

被引:1
|
作者
Nekkache, Ikram [1 ,2 ]
Jabbour, Said [1 ]
Sais, Lakhdar [1 ]
Kamel, Nadjet [2 ]
机构
[1] Univ Artois, CRIL CNRS UMR 8188, Lens, France
[2] Univ Ferhat Abbas Setif 1, Fac Sci, Dept Comp Sci, LRSD Lab, Setif, Algeria
关键词
Data mining; Itemset mining; Satisfiability; CONSTRAINT;
D O I
10.1007/978-3-030-78230-6_11
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Many pattern mining tasks have been modeled and solved using constraints programming (CP) and propositional satisfiability (SAT). In these two well-known declarative AI models, the problem is encoded as a constraints network or a propositional formula, whose associated models correspond to the patterns of interest. In this new declarative framework, new user-specified constraints can be easily integrated, while in traditional data mining, such additional constraints might require an implementation from scratch. Unfortunately, these declarative data mining approaches do not scale on large datasets, leading to huge size encodings. In this paper, we propose a compact SAT-based encoding for itemset mining tasks, by rewriting some key-constraints. We prove that this reformulation can be expressed as a Boolean matrix compression problem. To address this problem, we propose a greedy approach allowing us to reduce considerably the size of the encoding while improving the pattern enumeration step. Finally, we provide experimental evidence that our proposed approach achieves a significant reduction in the size of the encoding. These results show interesting improvements of this compact SAT-based itemset mining approach while reducing significantly the gap with the best state-of-the-art specialized algorithm.
引用
收藏
页码:163 / 178
页数:16
相关论文
共 50 条
  • [31] SAT-Based verification of LTL formulas
    Zhang, Wenhui
    [J]. FORMAL METHODS: APPLICATIONS AND TECHNOLOGY, 2007, 4346 : 277 - 292
  • [32] A SAT-based algorithm for context matching
    Bouquet, P
    Magnini, B
    Serafini, L
    Zanobini, S
    [J]. MODELING AND USING CONTEXT, PROCEEDINGS, 2003, 2680 : 66 - 79
  • [33] The SAT-based Approach to Separation Logic
    Alessandro Armando
    Claudio Castellini
    Enrico Giunchiglia
    Marco Maratea
    [J]. Journal of Automated Reasoning, 2005, 35 : 237 - 263
  • [34] Comparison Between SAT-Based and CSP-Based Approaches to Resolve Pattern Mining Problems
    Rajeb, Akram
    Ben Hamadou, Abdelmajid
    Loukil, Zied
    [J]. HYBRID INTELLIGENT SYSTEMS, HIS 2015, 2016, 420 : 307 - 314
  • [35] SAT-Based ATL Satisfiability Checking
    Kacprzak, Magdalena
    Niewiadomski, Artur
    Penczek, Wojciech
    [J]. KR2020: PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON PRINCIPLES OF KNOWLEDGE REPRESENTATION AND REASONING, 2020, : 539 - 549
  • [36] The SAT-based approach to separation logic
    Armando, Alessandro
    Castellini, Claudio
    Giunchiglia, Enrico
    Maratea, Marco
    [J]. JOURNAL OF AUTOMATED REASONING, 2005, 35 (1-3) : 237 - 263
  • [37] SAT-based summarization for boolean programs
    Basler, Gerard
    Kroening, Daniel
    Weissenbacher, Georg
    [J]. MODEL CHECKING SOFTWARE, PROCEEDINGS, 2007, 4595 : 131 - +
  • [38] SAT-based Analysis of Sensitisable Paths
    Sauer, Matthias
    Czutro, Alexander
    Schubert, Tobias
    Hillebrecht, Stefan
    Polian, Ilia
    Becker, Bernd
    [J]. 2011 IEEE 14TH INTERNATIONAL SYMPOSIUM ON DESIGN AND DIAGNOSTICS OF ELECTRONIC CIRCUITS AND SYSTEMS (DDECS), 2011, : 93 - 98
  • [39] Interpolation and SAT-based model checking
    McMillan, KL
    [J]. COMPUTER AIDED VERIFICATION, 2003, 2725 : 1 - 13
  • [40] SAT-based analysis of cellular automata
    D'Antonio, M
    Delzanno, G
    [J]. CELLULAR AUTOMATA, PROCEEDINGS, 2004, 3305 : 745 - 754