Towards a Compact SAT-Based Encoding of Itemset Mining Tasks

被引:1
|
作者
Nekkache, Ikram [1 ,2 ]
Jabbour, Said [1 ]
Sais, Lakhdar [1 ]
Kamel, Nadjet [2 ]
机构
[1] Univ Artois, CRIL CNRS UMR 8188, Lens, France
[2] Univ Ferhat Abbas Setif 1, Fac Sci, Dept Comp Sci, LRSD Lab, Setif, Algeria
关键词
Data mining; Itemset mining; Satisfiability; CONSTRAINT;
D O I
10.1007/978-3-030-78230-6_11
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Many pattern mining tasks have been modeled and solved using constraints programming (CP) and propositional satisfiability (SAT). In these two well-known declarative AI models, the problem is encoded as a constraints network or a propositional formula, whose associated models correspond to the patterns of interest. In this new declarative framework, new user-specified constraints can be easily integrated, while in traditional data mining, such additional constraints might require an implementation from scratch. Unfortunately, these declarative data mining approaches do not scale on large datasets, leading to huge size encodings. In this paper, we propose a compact SAT-based encoding for itemset mining tasks, by rewriting some key-constraints. We prove that this reformulation can be expressed as a Boolean matrix compression problem. To address this problem, we propose a greedy approach allowing us to reduce considerably the size of the encoding while improving the pattern enumeration step. Finally, we provide experimental evidence that our proposed approach achieves a significant reduction in the size of the encoding. These results show interesting improvements of this compact SAT-based itemset mining approach while reducing significantly the gap with the best state-of-the-art specialized algorithm.
引用
收藏
页码:163 / 178
页数:16
相关论文
共 50 条
  • [1] A Distributed SAT-Based Framework for Closed Frequent Itemset Mining
    Martin-Prin, Julien
    Dlala, Imen Ouled
    Travers, Nicolas
    Jabbour, Said
    [J]. ADVANCED DATA MINING AND APPLICATIONS, ADMA 2022, PT II, 2022, 13726 : 419 - 433
  • [2] SAT-Based Data Mining
    Boudane, Abdelhamid
    Jabbour, Said
    Sais, Lakhdar
    Salhi, Yakoub
    [J]. INTERNATIONAL JOURNAL ON ARTIFICIAL INTELLIGENCE TOOLS, 2018, 27 (01)
  • [3] SAT-based and CP-based declarative approaches for Top-Rank-K closed frequent itemset mining
    Abed, Sa'ed
    Abdelaal, Areej A.
    Al-Shayeji, Mohammad H.
    Ahmad, Imtiaz
    [J]. INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2021, 36 (01) : 112 - 151
  • [4] Decomposition Based SAT Encodings for Itemset Mining Problems
    Jabbour, Said
    Sais, Lakhdar
    Salhi, Yakoub
    [J]. ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PART II, 2015, 9078 : 662 - 674
  • [5] SAT-Based State Encoding for Peak Current Minimization
    Lee, Yongho
    Choi, Kiyoung
    Kim, Taewhan
    [J]. 2009 INTERNATIONAL SOC DESIGN CONFERENCE (ISOCC 2009), 2009, : 432 - 435
  • [6] Towards Parallel Direct SAT-Based Cryptanalysis
    Dudek, Pawel
    Kurkowski, Miroslaw
    Srebrny, Marian
    [J]. PARALLEL PROCESSING AND APPLIED MATHEMATICS, PT I, 2012, 7203 : 266 - 275
  • [7] SAT-based State Justification with Adaptive Mining of Invariants
    Wu, Weixin
    Hsiao, Michael S.
    [J]. 2008 IEEE INTERNATIONAL TEST CONFERENCE, VOLS 1 AND 2, PROCEEDINGS, 2008, : 176 - 185
  • [8] Modular SAT-based techniques for reasoning tasks in team semantics
    Durand, Arnaud
    Kontinen, Juha
    Vaananen, Jouko
    [J]. JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 2024, 146
  • [9] A novel structure-exploiting encoding for SAT-based diagnosis
    Siddiqi, Sajjad Ahmed
    [J]. JOURNAL OF EXPERIMENTAL & THEORETICAL ARTIFICIAL INTELLIGENCE, 2024, 36 (06) : 939 - 952
  • [10] Formalizing the Soundness of the Encoding Methods of SAT-based Model Checking
    Ishii, Daisuke
    Fujii, Saito
    [J]. 2020 INTERNATIONAL SYMPOSIUM ON THEORETICAL ASPECTS OF SOFTWARE ENGINEERING (TASE 2020), 2020, : 105 - 112