Data Mining Techniques for Producing Clustering in Big Data with MapReduce Function

被引:0
|
作者
Presskila, X. Arogya [1 ]
Robinson, Y. Harold [2 ]
机构
[1] Department of Computer Science and Engineering, SCAD College of Engineering and Technology, Tirunelveli, India
[2] School of Information Technology and Engineering, Vellore Institute of Technology, Vellore, India
来源
Studies in Big Data | 2021年 / 93卷
关键词
Clustering algorithms - Information management - Large dataset - MapReduce - Search engines;
D O I
暂无
中图分类号
学科分类号
摘要
Big data is a large collection of dataset from heterogeneous sources of data which may be terabytes or petabytes of data. The big data is useful for existing business growth and also supports to create the new business. Handling this much of data is very difficult in database management system. The problems of big data are storing, processing, analyzing, extracting, and privacy. This survey paper, mainly focused on challenges of big data, how to extract the required data from large volume of data, and also various clustering algorithm. For the extraction of data, mapreduce function is used which is mainly used in Google search engine. © Springer Science and Business Media Deutschland GmbH. All rights reserved.
引用
收藏
页码:195 / 203
相关论文
共 50 条
  • [1] MapReduce Clustering for Big Data
    Ghattas, Badih
    Pinto, Antoine
    Diao, Sambou
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2021, : 5116 - 5124
  • [2] Parallel Clustering Optimization Algorithm Based on MapReduce in Big Data Mining
    Zhang, Huajie
    Song, Lei
    Zhang, Sen
    [J]. IAENG International Journal of Applied Mathematics, 2023, 53 (01)
  • [3] The impact of big data market segmentation using data mining and clustering techniques
    Yoseph, Fahed
    Malim, Nurul Hashimah Ahamed Hassain
    Heikkila, Markku
    Brezulianu, Adrian
    Geman, Oana
    Rostam, Nur Aqilah Paskhal
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2020, 38 (05) : 6159 - 6173
  • [4] Clustering on Big Data Using Hadoop MapReduce
    Akthar, Nadeem
    Ahamad, Mohd Vasim
    Khan, Shahbaz
    [J]. 2015 INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND COMMUNICATION NETWORKS (CICN), 2015, : 789 - 795
  • [5] A Mapreduce Fuzzy Techniques of Big Data Classification
    El Bakry, Malak
    Safwat, Soha
    Hegazy, Osman
    [J]. PROCEEDINGS OF THE 2016 SAI COMPUTING CONFERENCE (SAI), 2016, : 118 - 128
  • [6] A Paralleled Big Data Algorithm with MapReduce Framework for Mining Twitter Data
    Li Bing
    Chan, Keith C. C.
    [J]. 2014 IEEE FOURTH INTERNATIONAL CONFERENCE ON BIG DATA AND CLOUD COMPUTING (BDCLOUD), 2014, : 121 - 128
  • [7] MapReduce based Method for Big Data Semantic Clustering
    Yang, Jie
    Li, Xiaoping
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC 2013), 2013, : 2814 - 2819
  • [8] Big data clustering with varied density based on MapReduce
    Safanaz Heidari
    Mahmood Alborzi
    Reza Radfar
    Mohammad Ali Afsharkazemi
    Ali Rajabzadeh Ghatari
    [J]. Journal of Big Data, 6
  • [9] Big data clustering with varied density based on MapReduce
    Heidari, Safanaz
    Alborzi, Mahmood
    Radfar, Reza
    Afsharkazemi, Mohammad Ali
    Ghatari, Ali Rajabzadeh
    [J]. JOURNAL OF BIG DATA, 2019, 6 (01)
  • [10] i2MapReduce: Incremental MapReduce for Mining Evolving Big Data
    Zhang, Yanfeng
    Chen, Shimin
    Wang, Qiang
    Yu, Ge
    [J]. 2016 32ND IEEE INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2016, : 1482 - 1483