New Algorithms of Feature Selection and Big Data Assignment for CBR System Integrated by Bayesian Network

被引:2
|
作者
Guo, Yuan [1 ,2 ]
Sun, Yu [1 ]
Wu, Kai [1 ]
Jiang, Kerong [2 ]
机构
[1] Nanjing Univ Sci & Technol, Sch Mech Engn, 200 Xiaolingwei St, Nanjing 210094, Peoples R China
[2] Hefei Univ, Sch Adv Mfg Engn, 99 Jinxiu Rd, Hefei 230601, Peoples R China
关键词
Feature selection; integrated system; CBR; big data; PARALLEL ALGORITHM;
D O I
10.1145/3373086
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Under big data, the integrated system of case-based reasoning and Bayesian network has exhibited great advantage in implementing the intelligence of engineering application in many domains. To further improve the performance of the hybrid system, this article proposes Probability Change Measurement of Solution Parameters (PCMSP)-Half-Division-Cross (HDC) method, which includes two algorithms, namely PCMSP and HDC algorithm. PCMSP algorithm can select principal problem features according to their effects upon all solution features measured by calculating the weighted relative probability (RP) change of all solution features caused by each problem feature. PCMSP algorithm can perfectly work under big data no matter how complex the data types are and how huge the data size is. HDC algorithm is used to assign the computation task of big data to enhance the efficiency of the integrated system. HDC algorithm assigns big data by grouping all the problem parameters into many small sub-groups and then distributing the data which covers the same sub-group of problem parameters to a slave node. HDC algorithm can guarantee enough efficiency of the integrated system under big data no matter how large the number of problem parameters is. Finally, lots of experiments are executed to validate the proposed method.
引用
收藏
页数:20
相关论文
共 50 条
  • [31] Automating Feature Extraction and Feature Selection in Big Data Security Analytics
    Sisiaridis, Dimitrios
    Markowitch, Olivier
    [J]. ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING (ICAISC 2018), PT II, 2018, 10842 : 423 - 432
  • [32] Study on Feature Selection and Feature Deep Learning Model For Big Data
    Yu, Ping
    Yan, Hui
    [J]. 2018 3RD INTERNATIONAL CONFERENCE ON SMART CITY AND SYSTEMS ENGINEERING (ICSCSE), 2018, : 792 - 795
  • [33] Towards ultrahigh dimensional feature selection for big data
    Tan, Mingkui
    Tsang, Ivor W.
    Wang, Li
    [J]. Journal of Machine Learning Research, 2014, 15 : 1371 - 1429
  • [34] Feature Selection Using Genetic Algorithm for Big Data
    Saidi, Rania
    Ncir, Waad Bouaguel
    Essoussi, Nadia
    [J]. INTERNATIONAL CONFERENCE ON ADVANCED MACHINE LEARNING TECHNOLOGIES AND APPLICATIONS (AMLTA2018), 2018, 723 : 352 - 361
  • [35] Improved Feature Selection Model for Big Data Analytics
    El-Hasnony, Ibrahim M.
    Barakat, Sherif I.
    Elhoseny, Mohamed
    Mostafa, Reham R.
    [J]. IEEE ACCESS, 2020, 8 : 66989 - 67004
  • [36] Scalable and Accurate Online Feature Selection for Big Data
    Yu, Kui
    Wu, Xindong
    Ding, Wei
    Pei, Jian
    [J]. ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2016, 11 (02)
  • [37] Reducing Data Complexity in Feature Extraction and Feature Selection for Big Data Security Analytics
    Sisiaridis, Dimitrios
    Markowitch, Olivier
    [J]. 2018 1ST INTERNATIONAL CONFERENCE ON DATA INTELLIGENCE AND SECURITY (ICDIS 2018), 2018, : 43 - 48
  • [38] Towards Ultrahigh Dimensional Feature Selection for Big Data
    Tan, Mingkui
    Tsang, Ivor W.
    Wang, Li
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2014, 15 : 1371 - 1429
  • [39] An online approach for feature selection for classification in big data
    Nazar, Nasrin Banu
    Senthilkumar, Radha
    [J]. TURKISH JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCES, 2017, 25 (01) : 163 - 171
  • [40] Distributed Evolutionary Feature Selection for Big Data Processing
    Bouaguel, Waad
    Ben NCir, Chiheb Eddine
    [J]. VIETNAM JOURNAL OF COMPUTER SCIENCE, 2022, 09 (03) : 313 - 332