Metaheuristic Based Clustering with Deep Learning Model for Big Data Classification

被引:3
|
作者
Krishnaswamy, R. [1 ]
Subramaniam, Kamalraj [2 ]
Nandini, V [3 ]
Vijayalakshmi, K. [4 ]
Kadry, Seifedine [5 ]
Nam, Yunyoung [6 ]
机构
[1] Univ Coll Engn Ariyalur, Dept Elect & Commun Engn, Ariyalur 621704, India
[2] Karpagam Acad Higher Educ, Fac Engn, Dept Biomed Engn, Coimbatore 641021, Tamil Nadu, India
[3] Sona Coll Technol, Dept Comp Sci & Engn, Salem 636005, India
[4] Saveetha Inst Med & Tech Sci, Saveetha Sch Engn, Dept Elect & Commun Engn, Chennai 600077, Tamil Nadu, India
[5] Noroff Univ Coll, Dept Appl Data Sci, Kristiansand, Norway
[6] Soonchunhyang Univ, Dept Comp Sci & Engn, Asan, South Korea
来源
关键词
Big data; data classification; clustering; mapreduce; dbscan algorithm;
D O I
10.32604/csse.2023.024901
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Recently, a massive quantity of data is being produced from a distinct number of sources and the size of the daily created on the Internet has crossed two Exabytes. At the same time, clustering is one of the efficient techniques for mining big data to extract the useful and hidden patterns that exist in it. Density-based clustering techniques have gained significant attention owing to the fact that it helps to effectively recognize complex patterns in spatial dataset. Big data clustering is a trivial process owing to the increasing quantity of data which can be solved by the use of Map Reduce tool. With this motivation, this paper presents an efficient Map Reduce based hybrid density based clustering and classification algorithm for big data analytics (MR-HDBCC). The proposed MR-HDBCC technique is executed on Map Reduce tool for handling the big data. In addition, the MR-HDBCC technique involves three distinct processes namely pre-processing, clustering, and classification. The proposed model utilizes the Density-Based Spatial Clustering of Applications with Noise (DBSCAN) technique which is capable of detecting random shapes and diverse clusters with noisy data. For improving the performance of the DBSCAN technique, a hybrid model using cockroach swarm optimization (CSO) algorithm is developed for the exploration of the search space and determine the optimal parameters for density based clustering. Finally, bidirectional gated recurrent neural network (BGRNN) is employed for the classification of big data. The experimental validation of the proposed MR-HDBCC technique takes place using the benchmark dataset and the simulation outcomes demonstrate the promising performance of the proposed model interms of different measures.
引用
收藏
页码:391 / 406
页数:16
相关论文
共 50 条
  • [1] Big Data and Deep Learning-Based Video Classification Model for Sports
    Wang, Lin
    Zhang, Haiyan
    Yuan, Guoliang
    [J]. WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2021, 2021
  • [2] Big Data Image Classification Based on Distributed Deep Representation Learning Model
    Zhu, Minjun
    Chen, Qinghua
    [J]. IEEE Access, 2020, 8 : 133890 - 133904
  • [3] Big Data Image Classification Based on Distributed Deep Representation Learning Model
    Zhu, Minjun
    Chen, Qinghua
    [J]. IEEE ACCESS, 2020, 8 : 133890 - 133904
  • [4] Image Classification Based on Deep Learning for Big Data of Power Grid
    Yin, Jun
    Zhu, Yongxin
    Shi, Weiwei
    Qiu, Yunru
    Liu, Xingying
    Sheng, Gehao
    [J]. PROCEEDINGS OF THE 5TH INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING AND AUTOMATIC CONTROL, 2016, 367 : 1233 - 1241
  • [5] Big Data Analytics with Optimal Deep Learning Model for Medical Image Classification
    Alqahtani, Tariq Mohammed
    [J]. COMPUTER SYSTEMS SCIENCE AND ENGINEERING, 2023, 44 (02): : 1433 - 1449
  • [6] Deep learning based sentiment classification on user-generated big data
    Kumar A.
    Jaiswal A.
    [J]. Jaiswal, Arunima (arunimajaiswal@gmail.com), 1600, Bentham Science Publishers (13): : 1047 - 1056
  • [7] Grid Clustering Analysis the Big Data of Spectrum by Deep Learning
    Chen Shuxin
    Sun Weimin
    [J]. 2017 2ND INTERNATIONAL CONFERENCE ON IMAGE, VISION AND COMPUTING (ICIVC 2017), 2017, : 1002 - 1005
  • [8] A Clustering Based Anonymization Model for Big Data
    Canbay, Yavuz
    Kalyoncu, Aydincan
    Ercimen, Mucahid
    Dogan, Adem
    Sagiroglu, Seref
    [J]. 2019 4TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ENGINEERING (UBMK), 2019, : 720 - 725
  • [9] Environment Classification for Lrban Big Data Using Deep Learning
    Hossain, M. Shamim
    Muhammad, Ghulam
    [J]. IEEE COMMUNICATIONS MAGAZINE, 2018, 56 (11) : 44 - 50
  • [10] Classification and unsupervised clustering of LIGO data with Deep Transfer Learning
    George, Daniel
    Shen, Hongyu
    Huerta, E. A.
    [J]. PHYSICAL REVIEW D, 2018, 97 (10)