STARM: STreaming Association Rules Mining in High-Dimensional Data

被引:0
|
作者
Gahar, Rania Mkhinini [1 ]
Arfaoui, Olfa [2 ]
Hidri, Adel [3 ]
Alsaif, Suleiman Ali [3 ]
Hidri, Minyar Sassi [3 ]
机构
[1] Univ Tunis El Manar, Natl Engn Sch Tunis, OASIS Res Lab, Tunis, Tunisia
[2] Univ Tunis El Manar, Natl Engn Sch Tunis, RISC Res Lab, Tunis, Tunisia
[3] Imam Abdulrahman Bin Faisal Univ, Dept Comp, Deanship Preparatory Year & Supporting Studies, Dammam, Saudi Arabia
关键词
Association Rules; Dimensionality Reduction; Spark Streaming; Apriori; Sliding Window;
D O I
10.1007/978-3-031-57853-3_12
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Predictive analytics involves using Data Mining algorithms to discover knowledge from large databases. The Association Rules (ARs) mining technique is considered to be one of the most prevalent data mining techniques in this context. When it comes to Big Data, we talk about data stream mining which is the process of extracting knowledge from continuous data streams. In this paper, STARM (STreaming Association Rules Mining) is proposed as an efficient and distributed algorithm for mining ARs. Based on the transaction-sensitive sliding-window model, the Apriori algorithm is applied to data streams to extract frequent itemsets (FI) that are then generated into ARs via Spark streaming framework. A Dimensionality Reduction (DR) step takes place as a data preprocessing step that may reduce the search space. The conducted experiments show that the proposed streaming model achieves state-of-the-art performance.
引用
收藏
页码:136 / 146
页数:11
相关论文
共 50 条
  • [41] Association Rules Mining on Retail Data
    Dagaslani, Hatice
    Basar, Ozlem Deniz
    EKOIST-JOURNAL OF ECONOMETRICS AND STATISTICS, 2022, (37):
  • [42] Formalizing Data Mining with Association Rules
    Rauch, Jan
    2012 IEEE INTERNATIONAL CONFERENCE ON GRANULAR COMPUTING (GRC 2012), 2012, : 406 - 411
  • [43] Study on the association rules of data mining
    Li, YR
    ISTM/2005: 6th International Symposium on Test and Measurement, Vols 1-9, Conference Proceedings, 2005, : 459 - 462
  • [44] Data mining in law with association rules
    Stranieri, A
    Zeleznikow, J
    Turner, H
    PROCEEDINGS OF THE IASTED INTERNATIONAL CONFERENCE ON LAW AND TECHNOLOGY, 2000, : 129 - 134
  • [45] Association rules mining of image data
    Shu, Feng-Di
    Wu, Guo-Qing
    Wang, Min
    Xiaoxing Weixing Jisuanji Xitong/Mini-Micro Systems, 2001, 22 (11):
  • [46] Data mining for ranged association rules
    Lee, DP
    Yang, SP
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL I AND II, 1999, : 32 - 37
  • [47] On data partitions for mining association rules
    Han, JL
    INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED PROCESSING TECHNIQUES AND APPLICATIONS, VOLS I-IV, PROCEEDINGS, 1998, : 1176 - 1182
  • [48] Pruning association rules in data mining
    Qin, Min
    Li, Zhi-Zhu
    2001, Shanghai Jiao Tong University (35):
  • [49] Agent-Based Data Mining Framework for the High-Dimensional Environment
    李侃
    刘玉树
    Journal of Beijing Institute of Technology, 2005, (02) : 113 - 116
  • [50] Mining the structural knowledge of high-dimensional medical data using Isomap
    Weng, S
    Zhang, C
    Lin, Z
    Zhang, X
    MEDICAL & BIOLOGICAL ENGINEERING & COMPUTING, 2005, 43 (03) : 410 - 412