Data Mining Techniques for Producing Clustering in Big Data with MapReduce Function

被引:0
|
作者
Presskila, X. Arogya [1 ]
Robinson, Y. Harold [2 ]
机构
[1] Department of Computer Science and Engineering, SCAD College of Engineering and Technology, Tirunelveli, India
[2] School of Information Technology and Engineering, Vellore Institute of Technology, Vellore, India
来源
Studies in Big Data | 2021年 / 93卷
关键词
Business growth - Clusterings - Data-mining techniques - Google search engine - Heterogeneous sources - Large volumes - Map-reduce - Petabytes;
D O I
暂无
中图分类号
学科分类号
摘要
Big data is a large collection of dataset from heterogeneous sources of data which may be terabytes or petabytes of data. The big data is useful for existing business growth and also supports to create the new business. Handling this much of data is very difficult in database management system. The problems of big data are storing, processing, analyzing, extracting, and privacy. This survey paper, mainly focused on challenges of big data, how to extract the required data from large volume of data, and also various clustering algorithm. For the extraction of data, mapreduce function is used which is mainly used in Google search engine. © Springer Science and Business Media Deutschland GmbH. All rights reserved.
引用
收藏
页码:195 / 203
相关论文
共 50 条
  • [21] An Efficient Clustering Technique for Big Data Mining
    Banait, Satish S.
    Sane, S. S.
    Talekar, Sopan A.
    INTERNATIONAL JOURNAL OF NEXT-GENERATION COMPUTING, 2022, 13 (03): : 702 - 717
  • [22] MapReduce-based parallel GEP algorithm for efficient function mining in big data applications
    Liu, Yang
    Ma, Chenxiao
    Xu, Lixiong
    Shen, Xiaodong
    Li, Maozhen
    Li, Pengcheng
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2018, 30 (23):
  • [23] Big Data Analytics Using Data Mining Techniques: A Survey
    Mittal, Shweta
    Sangwan, Om Prakash
    ADVANCED INFORMATICS FOR COMPUTING RESEARCH, ICAICR 2018, PT I, 2019, 955 : 264 - 273
  • [24] Data Mining and Fusion Techniques for WSNs as a Source of the Big Data
    Fouad, Mohamed Mostafa
    Oweis, Nour E.
    Gaber, Tarek
    Ahmed, Maamoun
    Snasel, Vaclav
    INTERNATIONAL CONFERENCE ON COMMUNICATIONS, MANAGEMENT, AND INFORMATION TECHNOLOGY (ICCMIT'2015), 2015, 65 : 778 - 786
  • [25] A Survey of Clustering Techniques for Big Data Analysis
    Arora, Saurabh
    Chana, Inderveer
    2014 5TH INTERNATIONAL CONFERENCE CONFLUENCE THE NEXT GENERATION INFORMATION TECHNOLOGY SUMMIT (CONFLUENCE), 2014, : 59 - 65
  • [26] EMR: Scalable Clustering of Big HR Data using Evolutionary MapReduce
    Bohlouli, Mahdi
    He, Zhonghua
    WEB CONFERENCE 2021: COMPANION OF THE WORLD WIDE WEB CONFERENCE (WWW 2021), 2021, : 26 - 34
  • [27] Optimized big data K-means clustering using MapReduce
    Cui, Xiaoli
    Zhu, Pingfei
    Yang, Xin
    Li, Keqiu
    Ji, Changqing
    JOURNAL OF SUPERCOMPUTING, 2014, 70 (03): : 1249 - 1259
  • [28] Optimized big data K-means clustering using MapReduce
    Xiaoli Cui
    Pingfei Zhu
    Xin Yang
    Keqiu Li
    Changqing Ji
    The Journal of Supercomputing, 2014, 70 : 1249 - 1259
  • [29] Hierarchical PSO Clustering on MapReduce for Scalable Privacy Preservation in Big Data
    Wai, Ei Nyein Chan
    Tsai, Pei-Wei
    Pan, Jeng-Shyang
    GENETIC AND EVOLUTIONARY COMPUTING, 2017, 536 : 36 - 44
  • [30] Efficient MapReduce Kernel k-Means for Big Data Clustering
    Tsapanos, Nikolaos
    Tefas, Anastasios
    Nikolaidis, Nikolaos
    Pitas, Ioannis
    9TH HELLENIC CONFERENCE ON ARTIFICIAL INTELLIGENCE (SETN 2016), 2016,