MRQAR: A generic MapReduce framework to discover quantitative association rules in big data problems

被引:30
|
作者
Martin, D. [1 ]
Martinez-Ballesteros, M. [2 ]
Garcia-Gil, D. [3 ]
Alcala-Fdez, J. [3 ]
Herrera, F. [3 ,4 ]
Riquelme-Santos, J. C. [2 ]
机构
[1] Technol Univ Havana JA Echeverria, Dept Artificial Intelligence & Infrastruct Inform, Havana, Cuba
[2] Univ Seville, Dept Comp Sci, Seville, Spain
[3] Univ Granada, Comp Sci & Artificial Intelligence, Granada, Spain
[4] King Abdulaziz Univ, Fac Comp & Informat Technol, Jeddah, Saudi Arabia
关键词
Quantitative association rules; Multiobjective evolutionary algorithms; Big Data; MapReduce; Spark; GENETIC ALGORITHM; MINE;
D O I
10.1016/j.knosys.2018.04.037
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Many algorithms have emerged to address the discovery of quantitative association rules from datasets in the last years. However, this task is becoming a challenge because the processing power of most existing techniques is not enough to handle the large amount of data generated nowadays. These vast amounts of data are known as Big Data. A number of previous studies have been focused on mining boolean or nominal association rules from Big Data problems, nevertheless, the data in real-world applications usually consist of quantitative values and designing data mining algorithms able to extract quantitative association rules presents a challenge to workers in this research field. In spite of the fact that we can find classical methods to discover boolean or nominal association rules in the most well-known repositories of Big Data algorithms, such repositories do not provide methods to discover quantitative association rules. Indeed, no methodologies have been proposed in the literature without prior discretization in Big Data. Hence, this work proposes MRQAR, a new generic parallel framework to discover quantitative association rules in large amounts of data, designed following the MapReduce paradigm using Apache Spark. MRQAR performs an incremental learning able to run any sequential quantitative association rule algorithm in Big Data problems without needing to redesign such algorithms. As a case study, we have integrated the multiobjective evolutionary algorithm MOPNAR into MRQAR to validate the generic MapReduce framework proposed in this work. The results obtained in the experimental study performed on five Big Data problems prove the capability of MRQAR to obtain reduced set of high quality rules in reasonable time.
引用
收藏
页码:176 / 192
页数:17
相关论文
共 50 条
  • [1] An Improved Parallel Association Rules Algorithm Based on MapReduce Framework for Big Data
    Zhou, Xinhao
    Huang, Yongfeng
    2014 11TH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (FSKD), 2014, : 284 - 288
  • [2] Mining association rules on Big Data through MapReduce genetic programming
    Padillo, F.
    Luna, J. M.
    Herrera, F.
    Ventura, S.
    INTEGRATED COMPUTER-AIDED ENGINEERING, 2018, 25 (01) : 31 - 48
  • [3] Association Rules Technology Applied to Big Data of Power Marketing Based on MapReduce
    Cheng, Xiao-rong
    He, Zhuang-zhuang
    Ma, Li
    2015 INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND TECHNOLOGY (ICCST 2015), 2015, : 70 - 75
  • [4] An Ontology-driven MapReduce Framework for Association Rules Mining in Massive Data
    Gahar, Rania Mkhinini
    Arfaoui, Olfa
    Sassi Hidri, Minyar
    Ben Hadj-Alouane, Nejib
    KNOWLEDGE-BASED AND INTELLIGENT INFORMATION & ENGINEERING SYSTEMS (KES-2018), 2018, 126 : 224 - 233
  • [5] Clustering of Association Rules for Big Datasets using Hadoop MapReduce
    Moahmmed, Salahadin A.
    Alasow, Mohamed A.
    El-Alfy, El-Sayed M.
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2021, 12 (03) : 536 - 545
  • [6] Selecting the best measures to discover quantitative association rules
    Martinez-Ballesteros, M.
    Martinez-Alvarez, F.
    Troncoso, A.
    Riquelme, J. C.
    NEUROCOMPUTING, 2014, 126 : 3 - 14
  • [7] A MapReduce Approach to Address Big Data Classification Problems Based on the Fusion of Linguistic Fuzzy Rules
    Sara del Río
    Victoria López
    José Manuel Benítez
    Francisco Herrera
    International Journal of Computational Intelligence Systems, 2015, 8 : 422 - 437
  • [8] A MapReduce Approach to Address Big Data Classification Problems Based on the Fusion of Linguistic Fuzzy Rules
    del Rio, Sara
    Lopez, Victoria
    Manuel Benitez, Jose
    Herrera, Francisco
    INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2015, 8 (03) : 422 - 437
  • [9] Big Data Analysis Solutions using MapReduce Framework
    Elagib, Sara B.
    Najeeb, Atahur Rahman
    Hashim, Aisha H.
    Olanrewaju, Rashidah F.
    2014 INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATION ENGINEERING (ICCCE), 2014, : 127 - 130
  • [10] Big data classification with optimization driven MapReduce framework
    Mohammed, Mujeeb Shaik
    Rachapudy, Praveen Sam
    Kasa, Madhavi
    INTERNATIONAL JOURNAL OF KNOWLEDGE-BASED AND INTELLIGENT ENGINEERING SYSTEMS, 2021, 25 (02) : 173 - 183