Fuzzy Neighbors and Deep Learning-Assisted Spark Model for Imbalanced Classification of Big Data

被引:0
|
作者
Nalinipriya, G. [1 ]
Geetha, M. [2 ]
Sudha, D. [3 ]
Daniya, T. [4 ]
机构
[1] Saveetha Engn Coll, Dept Informat Technol, Chennai 602105, Tamil Nadu, India
[2] Chennai Inst Technol, Dept Comp Sci & Engn, Chennai, Tamil Nadu, India
[3] Meenakshi Coll Engn, Dept Comp Sci & Engn, Chennai 600078, Tamil Nadu, India
[4] GMR Inst Technol, Dept Informat Technol, Rajam 532127, Andhra Prades, India
关键词
Big data classification; spark architecture; Bird Swarm optimization; Deep Belief network; Deer hunting optimization; MAPREDUCE; FRAMEWORK;
D O I
10.1142/S0218488523500095
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Big data is important in knowledge manipulation, assessment, and prediction. However, extracting and analyzing knowledge through big database are complex because of imbalance data distribution that leads to wrong decisions and biased classification outputs. Hence, an effective and optimal big data classification approach is designed using the proposed Bird Swarm Deer Hunting Optimization-Deep Belief Network (BSDHO-based DBN) algorithm based on spark architecture that follows the master and slave nodes. The proposed BSDHO is obtained by combining Deer Hunting Optimization algorithm and Bird Swarm Algorithm. The developed model poses two nodes, namely slave and master node. The training data is initially given to the master node in the spark architecture to perform transformation of data. Here, the transformation of data is done with an exponential log kernel, and then selection of feature is done with sequential forward selecting for choosing suitable features for enhanced processing. Consequently, oversampling process is performed with Fuzzy K-Nearest Neighbor (Fuzzy KNN) in the slave node using selected features to manage imbalance data. Then, in master node, classification is done with Deep belief Network, and trained using developed Bird swarm Deer Hunting Optimization (BSDHO) algorithm. On the other hand, the test data is taken as input, and is fed to the slave node to perform data transformation. Then, the transformed data is given to the master node for classification based on the proposed BSDHO. At last, the training data and testing data output produced the classified output. The proposed BSDHO-based DBN provided enhanced outcomes with highest specificity of 97.92%, accuracy of 96.92%, and sensitivity of 96.9%.
引用
收藏
页码:141 / 162
页数:22
相关论文
共 50 条
  • [41] Learning Deep Landmarks for Imbalanced Classification
    Bao, Feng
    Deng, Yue
    Kong, Youyong
    Ren, Zhiquan
    Suo, Jinli
    Dai, Qionghai
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (08) : 2691 - 2704
  • [42] Deep reinforcement learning for imbalanced classification
    Enlu Lin
    Qiong Chen
    Xiaoming Qi
    Applied Intelligence, 2020, 50 : 2488 - 2502
  • [43] Evolutionary Undersampling for Imbalanced Big Data Classification
    Triguero, I.
    Galar, M.
    Vluymans, S.
    Cornelis, C.
    Bustince, H.
    Herrera, F.
    Saeys, Y.
    2015 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2015, : 715 - 722
  • [44] HSDLM: A Hybrid Sampling With Deep Learning Method for Imbalanced Data Classification
    Hasib, Khan Md
    Towhid, Nurul Akter
    Islam, Md Rafiqul
    INTERNATIONAL JOURNAL OF CLOUD APPLICATIONS AND COMPUTING, 2021, 11 (04) : 1 - 13
  • [45] Spark Based Distributed Deep Learning Framework For Big Data Applications
    Khumoyun, Akhmedov
    Cui, Yun
    Hanku, Lee
    2016 INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND COMMUNICATIONS TECHNOLOGIES (ICISCT), 2016,
  • [46] Mobile Big Data Analytics Using Deep Learning and Apache Spark
    Abu Alsheikh, Mohammad
    Niyato, Dusit
    Lin, Shaowei
    Tan, Hwee-Pink
    Han, Zhu
    IEEE NETWORK, 2016, 30 (03): : 22 - 29
  • [47] Deep Learning-Assisted Diagnosis of Cerebral Aneurysms Using the HeadXNet Model
    Park, Allison
    Chute, Chris
    Rajpurkar, Pranav
    Lou, Joe
    Ball, Robyn L.
    Shpanskaya, Katie
    Jabarkheel, Rashad
    Kim, Lily H.
    McKenna, Emily
    Tseng, Joe
    Ni, Jason
    Wishah, Fidaa
    Wittber, Fred
    Hong, David S.
    Wilson, Thomas J.
    Halabi, Safwan
    Basu, Sanjay
    Patel, Bhavik N.
    Lungren, Matthew P.
    Ng, Andrew Y.
    Yeom, Kristen W.
    JAMA NETWORK OPEN, 2019, 2 (06) : e195600
  • [48] A Big Data Analysis Framework Using Apache Spark and Deep Learning
    Gupta, Anand
    Thakur, Hardeo Kumar
    Shrivastava, Ritvik
    Kumar, Pulkit
    Nag, Sreyashi
    2017 17TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW 2017), 2017, : 9 - 16
  • [50] A Data Augmentation-Assisted Deep Learning Model for High Dimensional and Highly Imbalanced Hyperspectral Imaging Data
    Rochac, Juan F. Ramirez
    Zhang, Nian
    Thompson, Lara
    Oladunni, Timothy
    2019 9TH INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND TECHNOLOGY (ICIST2019), 2019, : 362 - 367