Decomposition Based SAT Encodings for Itemset Mining Problems

被引:10
|
作者
Jabbour, Said [1 ]
Sais, Lakhdar [1 ]
Salhi, Yakoub [1 ]
机构
[1] Univ Artois, CRIL CNRS, F-62307 Lens 3, France
关键词
Declarative data mining; Itemset mining;
D O I
10.1007/978-3-319-18032-8_52
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently, several constraint programming (CP)/propositional satisfiability (SAT) based encodings have been proposed to deal with various data mining problems including itemset and sequence mining problems. This research issue allows to model data mining problems in a declarative way, while exploiting efficient and generic solving techniques. In practice, for large datasets, they usually lead to constraints network/Boolean formulas of huge size. Space complexity is clearly identified as the main bottleneck behind the competitiveness of these new declarative and flexible models w.r.t. specialized data mining approaches. In this paper, we address this issue by considering SAT based encodings of itemset mining problems. By partitioning the transaction database, we propose a new encoding framework for SAT based itemset mining problems. Experimental results on several known datasets show significant improvements, up to several orders of magnitude.
引用
收藏
页码:662 / 674
页数:13
相关论文
共 50 条
  • [31] DMA: Matrix Based Dynamic Itemset Mining Algorithm
    Oguz, Damla
    Yildiz, Baris
    Ergenc, Belgin
    INTERNATIONAL JOURNAL OF DATA WAREHOUSING AND MINING, 2013, 9 (04) : 62 - 75
  • [32] Frequent Itemset Mining Algorithm based on Sampling Method
    Li, Haifeng
    Zhang, Ning
    Zhang, Yuejin
    PROCEEDINGS OF THE 2015 5TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCES AND AUTOMATION ENGINEERING, 2016, 42 : 852 - 855
  • [33] Frequent Itemset Mining Algorithm Based on Linear Table
    Lu, Jun
    Xu, Wenhe
    Zhou, Kailong
    Guo, Zhicong
    JOURNAL OF DATABASE MANAGEMENT, 2023, 34 (01)
  • [34] An FPGA-Based Accelerator for Frequent Itemset Mining
    Zhang, Yan
    Zhang, Fan
    Jin, Zheming
    Bakos, Jason D.
    ACM TRANSACTIONS ON RECONFIGURABLE TECHNOLOGY AND SYSTEMS, 2013, 6 (01)
  • [35] Model-based probabilistic frequent itemset mining
    Bernecker, Thomas
    Cheng, Reynold
    Cheung, David W.
    Kriegel, Hans-Peter
    Lee, Sau Dan
    Renz, Matthias
    Verhein, Florian
    Wang, Liang
    Zuefle, Andreas
    KNOWLEDGE AND INFORMATION SYSTEMS, 2013, 37 (01) : 181 - 217
  • [36] A Distributed Frequent Itemset Mining Algorithm Based on Spark
    Gui, Feng
    Ma, Yunlong
    Zhang, Feng
    Liu, Min
    Li, Fei
    Shen, Weiming
    Bai, Hua
    PROCEEDINGS OF THE 2015 IEEE 19TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN (CSCWD), 2015, : 271 - 275
  • [37] The MiningZinc Framework for Constraint-based Itemset Mining
    Guns, Tias
    Dries, Anton
    Tack, Guido
    Nijssen, Siegfried
    De Raedt, Luc
    2013 IEEE 13TH INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW), 2013, : 1081 - 1084
  • [38] Model-based probabilistic frequent itemset mining
    Thomas Bernecker
    Reynold Cheng
    David W. Cheung
    Hans-Peter Kriegel
    Sau Dan Lee
    Matthias Renz
    Florian Verhein
    Liang Wang
    Andreas Zuefle
    Knowledge and Information Systems, 2013, 37 : 181 - 217
  • [39] Comparison Between SAT-Based and CSP-Based Approaches to Resolve Pattern Mining Problems
    Rajeb, Akram
    Ben Hamadou, Abdelmajid
    Loukil, Zied
    HYBRID INTELLIGENT SYSTEMS, HIS 2015, 2016, 420 : 307 - 314
  • [40] On the Propagation Strength of SAT Encodings for Qualitative Temporal Reasoning
    Westphal, Matthias
    Hue, Julien
    Woelfl, Stefan
    2013 IEEE 25TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI), 2013, : 46 - 54