PGLCM: efficient parallel mining of closed frequent gradual itemsets

被引:0
|
作者
Trong Dinh Thac Do
Alexandre Termier
Anne Laurent
Benjamin Negrevergne
Behrooz Omidvar-Tehrani
Sihem Amer-Yahia
机构
[1] University of Grenoble,LIG, CNRS UMR
[2] University of Montpellier II,LIRMM, CNRS UMR
[3] KU Leuven,Department of Computer Science
来源
关键词
Data mining; Frequent pattern mining; Gradual itemsets; Parallelism;
D O I
暂无
中图分类号
学科分类号
摘要
Numerical data (e.g., DNA micro-array data, sensor data) pose a challenging problem to existing frequent pattern mining methods which hardly handle them. In this framework, gradual patterns have been recently proposed to extract covariations of attributes, such as: “When X increases, Y decreases”. There exist some algorithms for mining frequent gradual patterns, but they cannot scale to real-world databases. We present in this paper GLCM, the first algorithm for mining closed frequent gradual patterns, which proposes strong complexity guarantees: the mining time is linear with the number of closed frequent gradual itemsets. Our experimental study shows that GLCM is two orders of magnitude faster than the state of the art, with a constant low memory usage. We also present PGLCM, a parallelization of GLCM capable of exploiting multicore processors, with good scale-up properties on complex datasets. These algorithms are the first algorithms capable of mining large real world datasets to discover gradual patterns.
引用
收藏
页码:497 / 527
页数:30
相关论文
共 50 条
  • [1] PGLCM: efficient parallel mining of closed frequent gradual itemsets
    Trong Dinh Thac Do
    Termier, Alexandre
    Laurent, Anne
    Negrevergne, Benjamin
    Omidvar-Tehrani, Behrooz
    Amer-Yahia, Sihem
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2015, 43 (03) : 497 - 527
  • [2] An Efficient Algorithm for Mining Frequent Closed Itemsets
    Fang, Gang
    Wu, Yue
    Li, Ming
    Chen, Jia
    [J]. INFORMATICA-JOURNAL OF COMPUTING AND INFORMATICS, 2015, 39 (01): : 87 - 98
  • [3] NUCLEAR: An Efficient Methods for Mining Frequent Itemsets and Generators from Closed Frequent Itemsets
    Huy Quang Pham
    Duc Tran
    Ninh Bao Duong
    Fournier-Viger, Philippe
    Alioune Ngom
    [J]. INFORMATION TECHNOLOGY IN INDUSTRY, 2019, 7 (02): : 1 - 13
  • [4] An Efficient Mining Model for Global Frequent Closed Itemsets
    Lin, Jianming
    Ju, Chunhua
    Liu, Dongsheng
    [J]. PROCEEDINGS OF THE SECOND INTERNATIONAL SYMPOSIUM ON ELECTRONIC COMMERCE AND SECURITY, VOL II, 2009, : 278 - 282
  • [5] An efficient algorithm for incrementally mining frequent closed itemsets
    Yen, Show-Jane
    Lee, Yue-Shi
    Wang, Chiu-Kuang
    [J]. APPLIED INTELLIGENCE, 2014, 40 (04) : 649 - 668
  • [6] Fast and memory efficient mining of frequent closed itemsets
    Lucchese, C
    Orlando, S
    Perego, R
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2006, 18 (01) : 21 - 36
  • [7] An efficient algorithm for incrementally mining frequent closed itemsets
    Show-Jane Yen
    Yue-Shi Lee
    Chiu-Kuang Wang
    [J]. Applied Intelligence, 2014, 40 : 649 - 668
  • [8] PNPFI: An Efficient Parallel Frequent Itemsets Mining Algorithm
    Zhang, Fang
    Zhang, Yu
    Liao, Xiaofei
    Jin, Hai
    [J]. PROCEEDINGS OF THE 2018 IEEE 22ND INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN ((CSCWD)), 2018, : 172 - 177
  • [9] An Efficient Algorithm for Mining Closed Frequent Itemsets in Data Streams
    Ao, Fujiang
    Du, Jing
    Yan, Yuejin
    Liu, Baohong
    Huang, Kedi
    [J]. 8TH IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION TECHNOLOGY WORKSHOPS: CIT WORKSHOPS 2008, PROCEEDINGS, 2008, : 37 - +
  • [10] IFCIA: An efficient algorithm for mining intertransaction frequent closed itemsets
    Dong, Jie
    Han, Min
    [J]. FOURTH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, VOL 2, PROCEEDINGS, 2007, : 678 - +