Data Mining Techniques for Producing Clustering in Big Data with MapReduce Function

被引:0
|
作者
Presskila, X. Arogya [1 ]
Robinson, Y. Harold [2 ]
机构
[1] Department of Computer Science and Engineering, SCAD College of Engineering and Technology, Tirunelveli, India
[2] School of Information Technology and Engineering, Vellore Institute of Technology, Vellore, India
来源
Studies in Big Data | 2021年 / 93卷
关键词
Business growth - Clusterings - Data-mining techniques - Google search engine - Heterogeneous sources - Large volumes - Map-reduce - Petabytes;
D O I
暂无
中图分类号
学科分类号
摘要
Big data is a large collection of dataset from heterogeneous sources of data which may be terabytes or petabytes of data. The big data is useful for existing business growth and also supports to create the new business. Handling this much of data is very difficult in database management system. The problems of big data are storing, processing, analyzing, extracting, and privacy. This survey paper, mainly focused on challenges of big data, how to extract the required data from large volume of data, and also various clustering algorithm. For the extraction of data, mapreduce function is used which is mainly used in Google search engine. © Springer Science and Business Media Deutschland GmbH. All rights reserved.
引用
收藏
页码:195 / 203
相关论文
共 50 条
  • [41] Big data mining with parallel computing: A comparison of distributed and MapReduce methodologies
    Tsai, Chih-Fong
    Lin, Wei-Chao
    Ke, Shih-Wen
    JOURNAL OF SYSTEMS AND SOFTWARE, 2016, 122 : 83 - 92
  • [42] Suggested Techniques for Clustering and Mining of Data Streams
    Anuradha, G.
    Roy, Bidisha
    2014 INTERNATIONAL CONFERENCE ON CIRCUITS, SYSTEMS, COMMUNICATION AND INFORMATION TECHNOLOGY APPLICATIONS (CSCITA), 2014, : 265 - 270
  • [43] Mining multidimensional data using clustering techniques
    Pagani, Marco
    Bordogna, Gloria
    Valle, Massimiliano
    DEXA 2007: 18TH INTERNATIONAL CONFERENCE ON DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2007, : 382 - +
  • [44] Challenges for MapReduce in Big Data
    Grolinger, Katarina
    Hayes, Michael
    Higashino, Wilson A.
    L'Heureux, Alexandra
    Allison, David S.
    Capretz, Miriam A. M.
    2014 IEEE WORLD CONGRESS ON SERVICES (SERVICES), 2014, : 182 - 189
  • [45] MapReduce: Simplified Data Analysis of Big Data
    Maitrey, Seema
    Jha, C. K.
    3RD INTERNATIONAL CONFERENCE ON RECENT TRENDS IN COMPUTING 2015 (ICRTC-2015), 2015, 57 : 563 - 571
  • [46] Analysis of agriculture data using data mining techniques: application of big data
    Majumdar J.
    Naraseeyappa S.
    Ankalaki S.
    Journal of Big Data, 4 (1)
  • [47] A Comprehensive Study on Clustering Approaches for Big Data Mining
    Pandove, Divya
    Goel, Shivani
    2015 2ND INTERNATIONAL CONFERENCE ON ELECTRONICS AND COMMUNICATION SYSTEMS (ICECS), 2015, : 1333 - 1338
  • [48] A survey of Big Data in social media using data mining techniques
    Gole, Sheela
    Tidke, Bharat
    ICACCS 2015 PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING & COMMUNICATION SYSTEMS, 2015,
  • [49] Big data analytics in health care by data mining and classification techniques
    Jayasri, N. P.
    Aruna, R.
    ICT EXPRESS, 2022, 8 (02): : 250 - 257
  • [50] Big data pre-processing methods with vehicle driving data using MapReduce techniques
    Wonhee Cho
    Eunmi Choi
    The Journal of Supercomputing, 2017, 73 : 3179 - 3195