GA-based feature selection method for oversized data analysis in digital economy

被引:1
|
作者
Lv, Yao [1 ,4 ]
Liu, Peng [1 ]
Wang, Juan [1 ]
Zhang, Yao [1 ]
Slowik, Adam [2 ]
Lv, Jianhui [3 ]
机构
[1] Shenyang Univ, Sch Appl Technol, Shenyang, Peoples R China
[2] Koszalin Univ Technol, Dept Comp Sci & Engn, Koszalin, Poland
[3] Pengcheng Lab, Dept Networks, Shenzhen, Peoples R China
[4] Shenyang Univ, Sch Appl Technol, Shenyang 110044, Peoples R China
基金
中国国家自然科学基金;
关键词
accelerate economic development; feature selection; genetic algorithm; machine learning; oversized data analysis;
D O I
10.1111/exsy.13477
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the promotion and development of oversized data technology, many data analysis platforms based on super large data storage and computing frameworks have emerged in the industry. While the platforms with oversized economic data analysis combined with machine learning models are still relatively lacking. And oversized data also brings a new problem, that is the security of economic development. It is an important and difficult task to analyse and detect risks from oversized economic data. Based on machine learning, data analysis, economic market and other multidisciplinary fields, this paper proposes a machine learning method, which is a genetic algorithm (GA) based feature selection method: FSGA. This method abstracts every possible feature selection result into an individual in GA, generates a population through genetic operation, and measures the merits of the individual through fitness. In addition, this paper has conducted multitudinous simulation experiments on the GA-based FSGA method and the traditional LSTM data analysis method respectively. The accuracy rate and other indicators are obtained by comparing the training. The experimental results show that the GA-based FSGA machine learning method has higher prediction accuracy when analysing oversized economic data. And it is practical to accelerate the development of digital economy.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] A HYBRID METHOD FOR INTRUSION DETECTION WITH GA-BASED FEATURE SELECTION
    Chen, Zh-Xian
    Huang, Hao
    [J]. INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2011, 17 (02): : 175 - 186
  • [2] GA-Based Feature Selection Method for Imbalanced Data with Application in Radio Signal Recognition
    Limin Du
    Yang Xu
    Jun Liu
    Fangli Ma
    [J]. International Journal of Computational Intelligence Systems, 2015, 8 : 39 - 47
  • [3] GA-Based Feature Selection Method for Imbalanced Data with Application in Radio Signal Recognition
    Du, Limin
    Xu, Yang
    Liu, Jun
    Ma, Fangli
    [J]. INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2015, 8 : 39 - 47
  • [4] A GA-Based Wrapper Feature Selection for Animal Breeding Data Mining
    Unold, Olgierd
    Dobrowolski, Maciej
    Maciejewski, Henryk
    Skrobanek, Pawel
    Walkowicz, Ewa
    [J]. HYBRID ARTIFICIAL INTELLIGENT SYSTEMS, PT II, 2012, 7209 : 200 - 209
  • [5] A Novel GA-based Feature Selection Approach for High Dimensional Data
    De Stefano, Claudio
    Fontanella, Francesco
    di Freca, Alessandra Scotto
    [J]. PROCEEDINGS OF THE 2016 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE (GECCO'16 COMPANION), 2016, : 87 - 88
  • [6] A GA-based Feature Selection for High-dimensional Data Clustering
    Sun, Mei
    Xiong, Langhuan
    Sun, Haojun
    Jiang, Dazhi
    [J]. THIRD INTERNATIONAL CONFERENCE ON GENETIC AND EVOLUTIONARY COMPUTING, 2009, : 769 - 772
  • [7] Feature selection for modular GA-based classification
    Zhu, FM
    Guan, S
    [J]. APPLIED SOFT COMPUTING, 2004, 4 (04) : 381 - 393
  • [8] GA-based feature subset selection for myoelectric classification
    Oskoei, Mohammadreza Asghari
    Hu, Huosheng
    [J]. 2006 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS, VOLS 1-3, 2006, : 1465 - +
  • [9] A GA-Based Approach to ICA Feature Selection: An Efficient Method to Classify Microarray Datasets
    Liu, Kun-Hong
    Zhang, Jun
    Li, Bo
    Du, Ji-Xiang
    [J]. ADVANCES IN NEURAL NETWORKS - ISNN 2009, PT 2, PROCEEDINGS, 2009, 5552 : 432 - +
  • [10] A GA-based feature selection algorithm for remote sensing images
    De Stefano, C.
    Fontanella, F.
    Marrocco, C.
    [J]. APPLICATIONS OF EVOLUTIONARY COMPUTING, PROCEEDINGS, 2008, 4974 : 285 - 294