A MapReduce-based Fuzzy Associative Classifier for Big Data

被引:0
|
作者
Ducange, Pietro [1 ]
Marcelloni, Francesco [2 ]
Segatori, Armando [2 ]
机构
[1] ECampus Univ, Fac Ingn, I-22060 Novedrate, Italy
[2] Univ Pisa, Dip Ingn Informaz, I-56122 Pisa, Italy
关键词
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, we propose an efficient distributed fuzzy associative classification model based on the MapReduce paradigm. The learning algorithm first mines a set of fuzzy association classification rules by employing a distributed version of a fuzzy extension of the well-known FP-Growth algorithm. Then, it prunes this set by using three purposely adapted types of pruning. We implemented the distributed fuzzy associative classifier using the Hadoop framework. We show the scalability of our approach by carrying out a number of experiments on a real-world big dataset. In particular, we evaluate the achievable speedup on a small computer cluster, highlighting that the proposed approach allows handling big datasets even with modest hardware support.
引用
收藏
页数:8
相关论文
共 50 条
  • [41] The MapReduce-based approach to improve vehicle controls on big traffic events
    Hamilton Adoni, Wilfried Yves
    Nahhal, Tarik
    Aghezzaf, Brahim
    Elbyed, Abdeltif
    2017 INTERNATIONAL COLLOQUIUM ON LOGISTICS AND SUPPLY CHAIN MANAGEMENT (LOGISTIQUA), 2017, : 1 - 6
  • [42] Holoentropy based Correlative Naive Bayes classifier and MapReduce model for classifying the big data
    Chitrakant Banchhor
    N. Srinivasu
    Evolutionary Intelligence, 2022, 15 : 1037 - 1050
  • [43] Holoentropy based Correlative Naive Bayes classifier and MapReduce model for classifying the big data
    Banchhor, Chitrakant
    Srinivasu, N.
    EVOLUTIONARY INTELLIGENCE, 2022, 15 (02) : 1037 - 1050
  • [44] MapReduce-based Capsule Networks
    Park, Sun Jin
    Park, Ho-Hyun
    2019 SIXTH INTERNATIONAL CONFERENCE ON SOCIAL NETWORKS ANALYSIS, MANAGEMENT AND SECURITY (SNAMS), 2019, : 99 - 101
  • [45] A MapReduce-Based Distributed SVM for Scalable Data Type Classification
    Jiang, Chong
    Wu, Ting
    Xu, Jian
    Zheng, Ning
    Xu, Ming
    Yang, Tao
    COLLABORATE COMPUTING: NETWORKING, APPLICATIONS AND WORKSHARING, COLLABORATECOM 2016, 2017, 201 : 115 - 126
  • [46] Fuzzy Associative Classification Algorithm Based on MapReduce Framework
    Bhukya, Raghuram
    Gyani, Jayadev
    PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON APPLIED AND THEORETICAL COMPUTING AND COMMUNICATION TECHNOLOGY (ICATCCT), 2015, : 357 - 360
  • [47] Enhancing in-memory efficiency for MapReduce-based data processing
    Veiga, Jorge
    Exposito, Roberto R.
    Taboada, Guillermo L.
    Tourino, Juan
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2018, 120 : 323 - 338
  • [48] The HiBench Benchmark Suite: Characterization of the MapReduce-Based Data Analysis
    Huang, Shengsheng
    Huang, Jie
    Dai, Jinquan
    Xie, Tao
    Huang, Bo
    2010 IEEE 26TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING WORKSHOPS (ICDE 2010), 2010, : 41 - 51
  • [49] Tri-training and MapReduce-based massive data learning
    Guo, Mao-Zu
    Deng, Chao
    Liu, Yang
    Li, Ping
    INTERNATIONAL JOURNAL OF GENERAL SYSTEMS, 2011, 40 (04) : 355 - 380
  • [50] The HiBench Benchmark Suite: Characterization of the MapReduce-Based Data Analysis
    Huang, Shengsheng
    Huang, Jie
    Dai, Jinquan
    Xie, Tao
    Huang, Bo
    NEW FRONTIERS IN INFORMATION AND SOFTWARE AS SERVICES: SERVICE AND APPLICATION DESIGN CHALLENGES IN THE CLOUD, 2011, 74 : 209 - 228