STARM: STreaming Association Rules Mining in High-Dimensional Data

被引:0
|
作者
Gahar, Rania Mkhinini [1 ]
Arfaoui, Olfa [2 ]
Hidri, Adel [3 ]
Alsaif, Suleiman Ali [3 ]
Hidri, Minyar Sassi [3 ]
机构
[1] Univ Tunis El Manar, Natl Engn Sch Tunis, OASIS Res Lab, Tunis, Tunisia
[2] Univ Tunis El Manar, Natl Engn Sch Tunis, RISC Res Lab, Tunis, Tunisia
[3] Imam Abdulrahman Bin Faisal Univ, Dept Comp, Deanship Preparatory Year & Supporting Studies, Dammam, Saudi Arabia
关键词
Association Rules; Dimensionality Reduction; Spark Streaming; Apriori; Sliding Window;
D O I
10.1007/978-3-031-57853-3_12
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Predictive analytics involves using Data Mining algorithms to discover knowledge from large databases. The Association Rules (ARs) mining technique is considered to be one of the most prevalent data mining techniques in this context. When it comes to Big Data, we talk about data stream mining which is the process of extracting knowledge from continuous data streams. In this paper, STARM (STreaming Association Rules Mining) is proposed as an efficient and distributed algorithm for mining ARs. Based on the transaction-sensitive sliding-window model, the Apriori algorithm is applied to data streams to extract frequent itemsets (FI) that are then generated into ARs via Spark streaming framework. A Dimensionality Reduction (DR) step takes place as a data preprocessing step that may reduce the search space. The conducted experiments show that the proposed streaming model achieves state-of-the-art performance.
引用
收藏
页码:136 / 146
页数:11
相关论文
共 50 条
  • [1] The Improved Research of Association Rules Mining Algorithm in High-Dimensional Big Data
    Du, Lingling
    NEW INDUSTRIALIZATION AND URBANIZATION DEVELOPMENT ANNUAL CONFERENCE: THE INTERNATIONAL FORUM ON NEW INDUSTRIALIZATION DEVELOPMENT IN BIG-DATA ERA, 2015, : 239 - 244
  • [2] Visualization and data mining of high-dimensional data
    Inselberg, A
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2002, 60 (1-2) : 147 - 159
  • [3] Deep mining method for high-dimensional big data based on association rule
    Xu, Shu
    INTERNATIONAL JOURNAL OF INTERNET PROTOCOL TECHNOLOGY, 2021, 14 (03) : 147 - 154
  • [4] Data Mining for High-Dimensional Measurement Systems
    Mikut, Ralf
    TM-TECHNISCHES MESSEN, 2010, 77 (10) : 524 - 529
  • [5] Efficient Association Rules Mining from Streaming Data with a Fault Tolerance
    Abd Elaty, Amr Aly
    Salem, Rashed
    Abd Elkader, Hatem
    PROCEEDINGS OF 2018 13TH INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING AND SYSTEMS (ICCES), 2018, : 627 - 632
  • [6] Association rule mining algorithms on high-dimensional datasets
    Ai, Dongmei
    Pan, Hongfei
    Li, Xiaoxin
    Gao, Yingxin
    He, Di
    ARTIFICIAL LIFE AND ROBOTICS, 2018, 23 (03) : 420 - 427
  • [7] An efficient clustering method of data mining for high-dimensional data
    Chang, JW
    Kang, HM
    8TH WORLD MULTI-CONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL II, PROCEEDINGS: COMPUTING TECHNIQUES, 2004, : 273 - 278
  • [8] High-dimensional clustering method for high performance data mining
    Chang, Jae-Woo
    Lee, Hyun-Jo
    COMPUTATIONAL SCIENCE - ICCS 2007, PT 3, PROCEEDINGS, 2007, 4489 : 621 - +
  • [9] Sequential Patterns Mining on High-Dimensional Data Stream
    Huang, Qin-Hua
    INTELLIGENT COMPUTING THEORIES AND APPLICATIONS, ICIC 2012, 2012, 7390 : 447 - 454
  • [10] High-dimensional Density Estimation for Data Mining Tasks
    Kuleshov, Alexander
    Bernstein, Alexander
    Yanovich, Yury
    2017 17TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW 2017), 2017, : 523 - 530