Fuzzy rule based classification systems for big data with MapReduce: granularity analysis

被引:34
|
作者
Fernandez, Alberto [1 ]
del Rio, Sara [1 ]
Bawakid, Abdullah [2 ]
Herrera, Francisco [1 ,2 ]
机构
[1] Univ Granada, Dept Comp Sci & Artificial Intelligence, Granada, Spain
[2] King Abdulaziz Univ, Fac Comp & Informat Technol, Jeddah, Saudi Arabia
关键词
Big data; Fuzzy rule based classification systems; Granularity; MapReduce; Hadoop; DATA SCIENCE; CHALLENGES; PROPOSAL;
D O I
10.1007/s11634-016-0260-z
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Due to the vast amount of information available nowadays, and the advantages related to the processing of this data, the topics of big data and data science have acquired a great importance in the current research. Big data applications are mainly about scalability, which can be achieved via the MapReduce programming model.It is designed to divide the data into several chunks or groups that are processed in parallel, and whose result is "assembled" to provide a single solution. Among different classification paradigms adapted to this new framework, fuzzy rule based classification systems have shown interesting results with a MapReduce approach for big data. It is well known that the performance of these types of systems has a strong dependence on the selection of a good granularity level for the Data Base. However, in the context of MapReduce this parameter is even harder to determine as it can be also related with the number of Maps chosen for the processing stage. In this paper, we aim at analyzing the interrelation between the number of labels of the fuzzy variables and the scarcity of the data due to the data sampling in MapReduce. Specifically, we consider that as the partitioning of the initial instance set grows, the level of granularity necessary to achieve a good performance also becomes higher. The experimental results, carried out for several Big Data problems, and using the Chi-FRBCS-BigData algorithms, support our claims.
引用
收藏
页码:711 / 730
页数:20
相关论文
共 50 条
  • [1] Fuzzy rule based classification systems for big data with MapReduce: granularity analysis
    Alberto Fernández
    Sara del Río
    Abdullah Bawakid
    Francisco Herrera
    [J]. Advances in Data Analysis and Classification, 2017, 11 : 711 - 730
  • [2] On the use of MapReduce to build Linguistic Fuzzy Rule Based Classification Systems for Big Data
    Lopez, Victoria
    del Rio, Sara
    Manuel Benitez, Jose
    Herrera, Francisco
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ-IEEE), 2014, : 1905 - 1912
  • [3] Cost-sensitive linguistic fuzzy rule based classification systems under the MapReduce framework for imbalanced big data
    Lopez, Victoria
    del Rio, Sara
    Manuel Benitez, Jose
    Herrera, Francisco
    [J]. FUZZY SETS AND SYSTEMS, 2015, 258 : 5 - 38
  • [4] A Mapreduce Fuzzy Techniques of Big Data Classification
    El Bakry, Malak
    Safwat, Soha
    Hegazy, Osman
    [J]. PROCEEDINGS OF THE 2016 SAI COMPUTING CONFERENCE (SAI), 2016, : 118 - 128
  • [5] Reasoning Methods in Fuzzy Rule-based Classification Systems for Big Data Problems
    Gonzalez, Antonio
    Perez, Raul
    Romero-Zaliz, Rocio
    [J]. PROCEEDINGS OF THE 4TH INTERNATIONAL CONFERENCE ON INTERNET OF THINGS, BIG DATA AND SECURITY (IOTBDS 2019), 2019, : 255 - 261
  • [6] Summarizer: Fuzzy Rule-Based Classification Systems for Vertical and Horizontal Big Data
    Tuy, Petala G. da S. E.
    Rios, Tatiane Nogueira
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ-IEEE), 2020,
  • [7] Why Linguistic Fuzzy Rule Based Classification Systems perform well in Big Data Applications?
    Alberto Fernández
    Abdulrahman Altalhi
    Saleh Alshomrani
    Francisco Herrera
    [J]. International Journal of Computational Intelligence Systems, 2017, 10 : 1211 - 1225
  • [8] Improving Fuzzy Rule Based Classification Systems in Big Data via Support-based Filtering
    Iniguez, Luis
    Galar, Mikel
    Fernandez, Alberto
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ-IEEE), 2018,
  • [9] Why Linguistic Fuzzy Rule Based Classification Systems perform well in Big Data Applications?
    Fernandez, Alberto
    Altalhi, Abdulrahman
    Alshomrani, Saleh
    Herrera, Francisco
    [J]. INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2017, 10 (01) : 1211 - 1225
  • [10] Analysis of the Big Data based on MapReduce
    Tian, Zi-de
    [J]. PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON AUTOMATION, MECHANICAL CONTROL AND COMPUTATIONAL ENGINEERING, 2015, 124 : 224 - 228