An Enhanced Memetic Algorithm for Feature Selection in Big Data Analytics with MapReduce

被引:3
|
作者
Ramakrishnan, Umanesan [1 ]
Nachimuthu, Nandhagopal [2 ]
机构
[1] Anna Univ, Chennai 600025, Tamil Nadu, India
[2] Excel Engn Coll, Dept Elect & Commun Engn, Namakkal 637303, India
来源
关键词
Big data analytics; metaheuristic; evolutionary algorithm; memetic optimization;
D O I
10.32604/iasc.2022.017123
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recently, various research fields have begun dealing with massive data sets forseveral functions. The main aim of a feature selection (FS) model is to eliminate noise, repetitive, and unnecessary featuresthat reduce the efficiency of classification. In a limited period, traditional FS models cannot manage massive datasets and filterunnecessary features. It has been discovered from the state-ofthe-art literature that metaheuristic algorithms perform better compared to other FS wrapper-based techniques. Common techniques such as the Genetic Algorithm (GA) andParticle Swarm Optimization (PSO) algorithm, however, suffer from slow convergence and local optima problems. Even with new generation algorithms such as Firefly heuristic and Fish Swarm Heuristic, these questions have been shown to overcome. This paper introduces an improved memetic optimization (EMO) algorithm for FS in this perspective by using conditional criteria in large datasets. The proposed EMO algorithm divides the entire dataset into sample blocksandconducts the task of learning in the map steps. The partial result obtained is combined into a final vector of feature weights in the reductionprocess which defines the appropriate collection of characteristics. Finally, the method of grouping based on the support vector machine (SVM) takes place. Within the Spark system, the proposed EMO algorithm is applied and the experimental results claim that it is superior to other approaches. The simulation results show that the maximum AUC values of 0.79 and 0.74 respectively are obtained by the EMO-FS model.
引用
收藏
页码:1547 / 1559
页数:13
相关论文
共 50 条
  • [1] Challenges of Feature Selection for Big Data Analytics
    Li, Jundong
    Liu, Huan
    [J]. IEEE INTELLIGENT SYSTEMS, 2017, 32 (02) : 9 - 15
  • [2] Feature Selection Techniques for Big Data Analytics
    Albattah, Waleed
    Khan, Rehan Ullah
    Alsharekh, Mohammed F.
    Khasawneh, Samer F.
    [J]. ELECTRONICS, 2022, 11 (19)
  • [3] Feature Selection and Classification of Big Data Using MapReduce Framework
    Devi, D. Renuka
    Sasikala, S.
    [J]. INTELLIGENT COMPUTING, INFORMATION AND CONTROL SYSTEMS, ICICCS 2019, 2020, 1039 : 666 - 673
  • [4] Evolutionary Feature Selection for Big Data Classification: A MapReduce Approach
    Peralta, Daniel
    del Rio, Sara
    Ramirez-Gallego, Sergio
    Triguero, Isaac
    Benitez, Josem.
    Herrera, Francisco
    [J]. MATHEMATICAL PROBLEMS IN ENGINEERING, 2015, 2015
  • [5] Improved Feature Selection Model for Big Data Analytics
    El-Hasnony, Ibrahim M.
    Barakat, Sherif I.
    Elhoseny, Mohamed
    Mostafa, Reham R.
    [J]. IEEE ACCESS, 2020, 8 : 66989 - 67004
  • [6] AMPO: Algorithm for MapReduce Performance Optimization for Enhancing Big Data Analytics
    Yambem, Nandita
    Nandakumar, A. N.
    [J]. 2017 INTERNATIONAL CONFERENCE ON ELECTRICAL, ELECTRONICS, COMMUNICATION, COMPUTER, AND OPTIMIZATION TECHNIQUES (ICEECCOT), 2017, : 717 - 723
  • [7] Automating Feature Extraction and Feature Selection in Big Data Security Analytics
    Sisiaridis, Dimitrios
    Markowitch, Olivier
    [J]. ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING (ICAISC 2018), PT II, 2018, 10842 : 423 - 432
  • [8] Reducing Data Complexity in Feature Extraction and Feature Selection for Big Data Security Analytics
    Sisiaridis, Dimitrios
    Markowitch, Olivier
    [J]. 2018 1ST INTERNATIONAL CONFERENCE ON DATA INTELLIGENCE AND SECURITY (ICDIS 2018), 2018, : 43 - 48
  • [9] A Novel Memetic Feature Selection Algorithm
    Montazeri, Mohadeseh
    Naji, Hamid Reza
    Montazeri, Mitra
    Faraahi, Ahmad
    [J]. 2013 5TH CONFERENCE ON INFORMATION AND KNOWLEDGE TECHNOLOGY (IKT), 2013, : 295 - 300
  • [10] Big Data Analytics based on PANFIS MapReduce
    Za'in, Choiru
    Pratama, Mahardhika
    Lughofer, Edwin
    Ferdaus, Meftahul
    Cai, Qing
    Prasad, Mukesh
    [J]. INNS CONFERENCE ON BIG DATA AND DEEP LEARNING, 2018, 144 : 140 - 152