RANDOM FOREST AND SUPPORT VECTOR MACHINE ON FEATURES SELECTION FOR REGRESSION ANALYSIS

被引:74
|
作者
Dewi, Christine [1 ]
Chen, Rung-Ching [1 ]
机构
[1] Chaoyang Univ Technol, Dept Informat Management, 168 Jifeng East Rd, Taichung 41349, Taiwan
关键词
Random forest; Features selection; SVM; Regression; VARIABLE IMPORTANCE; CLASSIFICATION; TREES;
D O I
10.24507/ijicic.15.06.2027
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Feature selection becomes predominant and quite prominent in the case of datasets that are contained with a higher number of variables. RF (Random Forest) has emerged as a robust algorithm that can handle a feature selection problem with a higher number of variables. It is also very much efficient while dealing with regression problems. In this work, we proposed the combination of RF, SVM (Support Vector Machine) and tune SVM regression to improve the model performance. We use four outstanding regression datasets from the UCI (University of California Irvine) machine learning repository. In addition, the ranking of important features by RF for affection factors is given out. We prove that it is essential to select the best features to improve the performance of the model. The experimental results show that our proposed model has a better effect compared to other methods in each dataset. The trend of RMSE (Root Mean Squared Error) value is decreased, and the r-value is increased in every experiment for all datasets. Furthermore, it is indicated that the regression predictions perfectly fit the data.
引用
收藏
页码:2027 / 2037
页数:11
相关论文
共 50 条
  • [21] Random Forest Ensemble of Support Vector Regression Models for Solar Power Forecasting
    Abuella, Mohamed
    Chowdhury, Badrul
    2017 IEEE POWER & ENERGY SOCIETY INNOVATIVE SMART GRID TECHNOLOGIES CONFERENCE (ISGT), 2017,
  • [22] Statistical analysis and prediction of force and overtopping rates on large-scale vertical walls using support vector machine and random forest regression
    Croquer, Sergio
    Lacey, Jay
    Nistor, Ioan
    CANADIAN JOURNAL OF CIVIL ENGINEERING, 2023, 50 (05) : 375 - 386
  • [23] Variable selection using support vector regression and random forests: A comparative study
    Ben Ishak, Anis
    INTELLIGENT DATA ANALYSIS, 2016, 20 (01) : 83 - 104
  • [24] Selection Features and Support Vector Machine for Credit Card Risk Identification
    Rtayli, Naoufal
    Enneya, Nourddine
    13TH INTERNATIONAL CONFERENCE INTERDISCIPLINARITY IN ENGINEERING (INTER-ENG 2019), 2020, 46 : 941 - 948
  • [25] Regression depth and support vector machine
    Christmann, Andreas
    DATA DEPTH: ROBUST MULTIVARIATE ANALYSIS, COMPUTATIONAL GEOMETRY AND APPLICATIONS, 2006, 72 : 71 - 85
  • [26] An ε-twin support vector machine for regression
    Yuan-Hai Shao
    Chun-Hua Zhang
    Zhi-Min Yang
    Ling Jing
    Nai-Yang Deng
    Neural Computing and Applications, 2013, 23 : 175 - 185
  • [27] Comparison between random forest and support vector machine algorithms for LULC classification
    Avci, Cengiz
    Budak, Muhammed
    Yagmur, Nur
    Balcik, Filiz Bektas
    INTERNATIONAL JOURNAL OF ENGINEERING AND GEOSCIENCES, 2023, 8 (01): : 1 - 10
  • [28] Cancer Disease Prediction with Support Vector Machine and Random Forest Classification Techniques
    Ahmed, Ashfaq K.
    Aljahdali, Sultan
    Hundewale, Nisar
    Ahmed, Ishthaq K.
    2012 IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND CYBERNETICS (CYBERNETICSCOM), 2012, : 16 - 19
  • [29] An ε-twin support vector machine for regression
    Shao, Yuan-Hai
    Zhang, Chun-Hua
    Yang, Zhi-Min
    Jing, Ling
    Deng, Nai-Yang
    NEURAL COMPUTING & APPLICATIONS, 2013, 23 (01): : 175 - 185
  • [30] Prediction of impermeability of the concrete structure based on random forest and support vector machine
    Wang, Lei
    Wu, Xianguo
    Chen, Hongyu
    Zeng, Tiemei
    2020 INTERNATIONAL CONFERENCE ON SUSTAINABLE DEVELOPMENT AND ENVIRONMENTAL SCIENCE, 2020, 552