New Algorithms of Feature Selection and Big Data Assignment for CBR System Integrated by Bayesian Network

被引:2
|
作者
Guo, Yuan [1 ,2 ]
Sun, Yu [1 ]
Wu, Kai [1 ]
Jiang, Kerong [2 ]
机构
[1] Nanjing Univ Sci & Technol, Sch Mech Engn, 200 Xiaolingwei St, Nanjing 210094, Peoples R China
[2] Hefei Univ, Sch Adv Mfg Engn, 99 Jinxiu Rd, Hefei 230601, Peoples R China
关键词
Feature selection; integrated system; CBR; big data; PARALLEL ALGORITHM;
D O I
10.1145/3373086
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Under big data, the integrated system of case-based reasoning and Bayesian network has exhibited great advantage in implementing the intelligence of engineering application in many domains. To further improve the performance of the hybrid system, this article proposes Probability Change Measurement of Solution Parameters (PCMSP)-Half-Division-Cross (HDC) method, which includes two algorithms, namely PCMSP and HDC algorithm. PCMSP algorithm can select principal problem features according to their effects upon all solution features measured by calculating the weighted relative probability (RP) change of all solution features caused by each problem feature. PCMSP algorithm can perfectly work under big data no matter how complex the data types are and how huge the data size is. HDC algorithm is used to assign the computation task of big data to enhance the efficiency of the integrated system. HDC algorithm assigns big data by grouping all the problem parameters into many small sub-groups and then distributing the data which covers the same sub-group of problem parameters to a slave node. HDC algorithm can guarantee enough efficiency of the integrated system under big data no matter how large the number of problem parameters is. Finally, lots of experiments are executed to validate the proposed method.
引用
收藏
页数:20
相关论文
共 50 条
  • [1] Machine learning based feature selection and knowledge reasoning for CBR system under big data
    Guo, Yuan
    Zhang, Bing
    Sun, Y.
    Jiang, K.
    Wu, K.
    [J]. Pattern Recognition, 2021, 112
  • [2] Machine learning based feature selection and knowledge reasoning for CBR system under big data
    Guo, Yuan
    Zhang, Bing
    Sun, Y.
    Jiang, K.
    Wu, K.
    [J]. PATTERN RECOGNITION, 2021, 112
  • [3] Streaming feature selection algorithms for big data: A survey
    AlNuaimi, Noura
    Masud, Mohammad Mehedy
    Serhani, Mohamed Adel
    Zaki, Nazar
    [J]. APPLIED COMPUTING AND INFORMATICS, 2022, 18 (1/2) : 113 - 135
  • [4] Feature Selection for Naive Bayesian Network Ensemble using Evolutionary Algorithms
    Zagorecki, Adam
    [J]. FEDERATED CONFERENCE ON COMPUTER SCIENCE AND INFORMATION SYSTEMS, 2014, 2014, 2 : 381 - 385
  • [5] A Bayesian network approach to feature selection in mass spectrometry data
    Karl W Kuschner
    Dariya I Malyarenko
    William E Cooke
    Lisa H Cazares
    OJ Semmes
    Eugene R Tracy
    [J]. BMC Bioinformatics, 11
  • [6] A Bayesian network approach to feature selection in mass spectrometry data
    Kuschner, Karl W.
    Malyarenko, Dariya I.
    Cooke, William E.
    Cazares, Lisa H.
    Semmes, O. J.
    Tracy, Eugene R.
    [J]. BMC BIOINFORMATICS, 2010, 11
  • [7] Data visualization and feature selection: New algorithms for nongaussian data
    Yang, HH
    Moody, J
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 12, 2000, 12 : 687 - 693
  • [8] An Evaluation of Feature Selection and Reduction Algorithms for Network IDS Data
    Bjerkestrand, Therese
    Tsaptsinos, Dimitris
    Pfluegel, Eckhard
    [J]. 2015 International Conference on Cyber Situational Awareness, Data Analytics and Assessment (CyberSA), 2015,
  • [9] A New Big Data Feature Selection Approach for Text Classification
    Amazal, Houda
    Kissi, Mohamed
    [J]. SCIENTIFIC PROGRAMMING, 2021, 2021
  • [10] BAYESIAN FEATURE SELECTION WITH DATA INTEGRATION
    Pour, Ali Foroughi
    Dalton, Lori A.
    [J]. 2018 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP 2018), 2018, : 504 - 508