MRQAR: A generic MapReduce framework to discover quantitative association rules in big data problems

被引:30
|
作者
Martin, D. [1 ]
Martinez-Ballesteros, M. [2 ]
Garcia-Gil, D. [3 ]
Alcala-Fdez, J. [3 ]
Herrera, F. [3 ,4 ]
Riquelme-Santos, J. C. [2 ]
机构
[1] Technol Univ Havana JA Echeverria, Dept Artificial Intelligence & Infrastruct Inform, Havana, Cuba
[2] Univ Seville, Dept Comp Sci, Seville, Spain
[3] Univ Granada, Comp Sci & Artificial Intelligence, Granada, Spain
[4] King Abdulaziz Univ, Fac Comp & Informat Technol, Jeddah, Saudi Arabia
关键词
Quantitative association rules; Multiobjective evolutionary algorithms; Big Data; MapReduce; Spark; GENETIC ALGORITHM; MINE;
D O I
10.1016/j.knosys.2018.04.037
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Many algorithms have emerged to address the discovery of quantitative association rules from datasets in the last years. However, this task is becoming a challenge because the processing power of most existing techniques is not enough to handle the large amount of data generated nowadays. These vast amounts of data are known as Big Data. A number of previous studies have been focused on mining boolean or nominal association rules from Big Data problems, nevertheless, the data in real-world applications usually consist of quantitative values and designing data mining algorithms able to extract quantitative association rules presents a challenge to workers in this research field. In spite of the fact that we can find classical methods to discover boolean or nominal association rules in the most well-known repositories of Big Data algorithms, such repositories do not provide methods to discover quantitative association rules. Indeed, no methodologies have been proposed in the literature without prior discretization in Big Data. Hence, this work proposes MRQAR, a new generic parallel framework to discover quantitative association rules in large amounts of data, designed following the MapReduce paradigm using Apache Spark. MRQAR performs an incremental learning able to run any sequential quantitative association rule algorithm in Big Data problems without needing to redesign such algorithms. As a case study, we have integrated the multiobjective evolutionary algorithm MOPNAR into MRQAR to validate the generic MapReduce framework proposed in this work. The results obtained in the experimental study performed on five Big Data problems prove the capability of MRQAR to obtain reduced set of high quality rules in reasonable time.
引用
收藏
页码:176 / 192
页数:17
相关论文
共 50 条
  • [41] A framework for mining association rules in data warehouses
    Tjioe, HC
    Taniar, D
    INTELLIGENT DAA ENGINEERING AND AUTOMATED LEARNING IDEAL 2004, PROCEEDINGS, 2004, 3177 : 159 - 165
  • [42] Real-time big data image classification under MapReduce framework
    Feng, Lin, 1600, Institute of Computing Technology (26):
  • [43] Big data analytics for retail industry using MapReduce-Apriori framework
    Verma, Neha
    Malhotra, Dheeraj
    Singh, Jatinder
    JOURNAL OF MANAGEMENT ANALYTICS, 2020, 7 (03) : 424 - 442
  • [44] A MapReduce Based Distributed Framework for Similarity Search in Healthcare Big Data Environment
    Sarma, Hiren K. D.
    Dwivedi, Yogesh K.
    Rana, Nripendra P.
    Slade, Emma L.
    OPEN AND BIG DATA MANAGEMENT AND INNOVATION, I3E 2015, 2015, 9373 : 173 - 182
  • [45] A Distributed Framework for Predictive Analytics Using Big Data and MapReduce Parallel Programming
    Natesan P.
    Sathishkumar V.E.
    Mathivanan S.K.
    Venkatasen M.
    Jayagopal P.
    Allayear S.M.
    Mathematical Problems in Engineering, 2023, 2023
  • [46] A big data MapReduce framework for fault diagnosis in cloud-based manufacturing
    Kumar, Ajay
    Shankar, Ravi
    Choudhary, Alok
    Thakur, Lakshman S.
    INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH, 2016, 54 (23) : 7060 - 7073
  • [47] A New MapReduce Approach with Dynamic Fuzzy Inference for Big Data Classification Problems
    Jin, Shangzhu
    Peng, Jun
    Xie, Dong
    INTERNATIONAL JOURNAL OF COGNITIVE INFORMATICS AND NATURAL INTELLIGENCE, 2018, 12 (03) : 40 - 54
  • [48] A Novel Mapreduce Lift Association Rule Mining Algorithm (MRLAR) for Big Data
    Oweis, Nour E.
    Fouad, Mohamed Mostafa
    Oweis, Sami R.
    Owais, Suhail S.
    Snasel, Vaclav
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2016, 7 (03) : 151 - 157
  • [49] A Comprehensive Survey Of Association Rules On Quantitative Data In Data Mining
    Gosain, Anjana
    Bhugra, Maneela
    2013 IEEE CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGIES (ICT 2013), 2013, : 1003 - 1008
  • [50] An Informative Base of Positive and Negative Association Rules on Big Data
    Parfait, Bemarisika
    Andre, Totohasina
    2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2019, : 2428 - 2437