STARM: STreaming Association Rules Mining in High-Dimensional Data

被引:0
|
作者
Gahar, Rania Mkhinini [1 ]
Arfaoui, Olfa [2 ]
Hidri, Adel [3 ]
Alsaif, Suleiman Ali [3 ]
Hidri, Minyar Sassi [3 ]
机构
[1] Univ Tunis El Manar, Natl Engn Sch Tunis, OASIS Res Lab, Tunis, Tunisia
[2] Univ Tunis El Manar, Natl Engn Sch Tunis, RISC Res Lab, Tunis, Tunisia
[3] Imam Abdulrahman Bin Faisal Univ, Dept Comp, Deanship Preparatory Year & Supporting Studies, Dammam, Saudi Arabia
关键词
Association Rules; Dimensionality Reduction; Spark Streaming; Apriori; Sliding Window;
D O I
10.1007/978-3-031-57853-3_12
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Predictive analytics involves using Data Mining algorithms to discover knowledge from large databases. The Association Rules (ARs) mining technique is considered to be one of the most prevalent data mining techniques in this context. When it comes to Big Data, we talk about data stream mining which is the process of extracting knowledge from continuous data streams. In this paper, STARM (STreaming Association Rules Mining) is proposed as an efficient and distributed algorithm for mining ARs. Based on the transaction-sensitive sliding-window model, the Apriori algorithm is applied to data streams to extract frequent itemsets (FI) that are then generated into ARs via Spark streaming framework. A Dimensionality Reduction (DR) step takes place as a data preprocessing step that may reduce the search space. The conducted experiments show that the proposed streaming model achieves state-of-the-art performance.
引用
收藏
页码:136 / 146
页数:11
相关论文
共 50 条
  • [21] Clustering High-Dimensional Stock Data using Data Mining Approach
    Indriyanti, Dhea
    Dhini, Arian
    2019 16TH INTERNATIONAL CONFERENCE ON SERVICE SYSTEMS AND SERVICE MANAGEMENT (ICSSSM2019), 2019,
  • [22] Online inference in high-dimensional generalized linear models with streaming data
    Luo, Lan
    Han, Ruijian
    Lin, Yuanyuan
    Huang, Jian
    ELECTRONIC JOURNAL OF STATISTICS, 2023, 17 (02): : 3443 - 3471
  • [23] Online sparse sliced inverse regression for high-dimensional streaming data
    Xu, Jianjun
    Cui, Wenquan
    Cheng, Haoyang
    INTERNATIONAL JOURNAL OF WAVELETS MULTIRESOLUTION AND INFORMATION PROCESSING, 2023, 21 (02)
  • [24] Event correlation and forecasting over high-dimensional streaming sensor data
    Papataxiarhis, Vassilis
    Hadjiefthymiades, Stathes
    2018 14TH INTERNATIONAL CONFERENCE ON WIRELESS AND MOBILE COMPUTING, NETWORKING AND COMMUNICATIONS (WIMOB 2018), 2018,
  • [25] HD-eye: visual mining of high-dimensional data
    Hinneburg, Alexander
    Keim, Daniel A.
    Wawryniuk, Markus
    IEEE Computer Graphics and Applications, 19 (05): : 22 - 31
  • [26] HD-eye: Visual mining of high-dimensional data
    Hinneburg, A
    Keim, DA
    Wawryniuk, M
    IEEE COMPUTER GRAPHICS AND APPLICATIONS, 1999, 19 (05) : 22 - 31
  • [27] An Optimal Big Data Analytics with Concept Drift Detection on High-Dimensional Streaming Data
    Mansour, Romany F.
    Al-Otaibi, Shaha
    Al-Rasheed, Amal
    Aljuaid, Hanan
    Pustokhina, Irina, V
    Pustokhin, Denis A.
    CMC-COMPUTERS MATERIALS & CONTINUA, 2021, 68 (03): : 2843 - 2858
  • [28] Using data cube for mining of hybrid-dimensional association rules
    Li, ZJ
    Huang, FX
    Zhou, DQ
    Zhang, P
    GRID AND COOPERATIVE COMPUTING, PT 2, 2004, 3033 : 899 - 902
  • [29] A Local Discrete Text Data Mining Method in High-Dimensional Data Space
    Juan Li
    Aiping Chen
    International Journal of Computational Intelligence Systems, 15
  • [30] A Local Discrete Text Data Mining Method in High-Dimensional Data Space
    Li, Juan
    Chen, Aiping
    INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2022, 15 (01)