RANDOM FOREST AND SUPPORT VECTOR MACHINE ON FEATURES SELECTION FOR REGRESSION ANALYSIS

被引:74
|
作者
Dewi, Christine [1 ]
Chen, Rung-Ching [1 ]
机构
[1] Chaoyang Univ Technol, Dept Informat Management, 168 Jifeng East Rd, Taichung 41349, Taiwan
关键词
Random forest; Features selection; SVM; Regression; VARIABLE IMPORTANCE; CLASSIFICATION; TREES;
D O I
10.24507/ijicic.15.06.2027
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Feature selection becomes predominant and quite prominent in the case of datasets that are contained with a higher number of variables. RF (Random Forest) has emerged as a robust algorithm that can handle a feature selection problem with a higher number of variables. It is also very much efficient while dealing with regression problems. In this work, we proposed the combination of RF, SVM (Support Vector Machine) and tune SVM regression to improve the model performance. We use four outstanding regression datasets from the UCI (University of California Irvine) machine learning repository. In addition, the ranking of important features by RF for affection factors is given out. We prove that it is essential to select the best features to improve the performance of the model. The experimental results show that our proposed model has a better effect compared to other methods in each dataset. The trend of RMSE (Root Mean Squared Error) value is decreased, and the r-value is increased in every experiment for all datasets. Furthermore, it is indicated that the regression predictions perfectly fit the data.
引用
收藏
页码:2027 / 2037
页数:11
相关论文
共 50 条
  • [1] Machine Learning Scoring Functions Based on Random Forest and Support Vector Regression
    Ballester, Pedro J.
    PATTERN RECOGNITION IN BIOINFORMATICS, 2012, 7632 : 14 - 25
  • [2] Comparison of random forest and support vector machine regression models for forecasting road accidents
    Gatera, Antoine
    Kuradusenge, Martin
    Bajpai, Gaurav
    Mikeka, Chomora
    Shrivastava, Sarika
    SCIENTIFIC AFRICAN, 2023, 21
  • [3] Seepage and dam deformation analyses with statistical models: support vector regression machine and random forest
    Belmokre, Ahmed
    Mihoubi, Mustapha Kamel
    Santillan, David
    3RD INTERNATIONAL CONFERENCE ON STRUCTURAL INTEGRITY (ICSI 2019), 2019, 17 : 698 - 703
  • [4] Comparative analysis of Random Forest and Support Vector Machine for benthic habitat segmentation
    Narciso, Gilson A. M.
    Tamondong, Ayin M.
    Blanco, Ariel C.
    Nakamura, Takashi
    Nadaoka, Kazuo
    EIGHTH GEOINFORMATION SCIENCE SYMPOSIUM 2023: GEOINFORMATION SCIENCE FOR SUSTAINABLE PLANET, 2024, 12977
  • [5] Random Forest and Support Vector Machine based Hybrid Approach to Sentiment Analysis
    Al Amrani, Yassine
    Lazaar, Mohamed
    El Kadiri, Kamal Eddine
    PROCEEDINGS OF THE FIRST INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING IN DATA SCIENCES (ICDS2017), 2018, 127 : 511 - 520
  • [6] Stochastic Support Vector Machine for Classifying and Regression of Random Variables
    Abaszade, Maryam
    Effati, Sohrab
    NEURAL PROCESSING LETTERS, 2018, 48 (01) : 1 - 29
  • [7] Stochastic Support Vector Machine for Classifying and Regression of Random Variables
    Maryam Abaszade
    Sohrab Effati
    Neural Processing Letters, 2018, 48 : 1 - 29
  • [8] Interval Support Vector Machine in Regression Analysis
    Arjmandzadeh, Ameneh
    Effati, Sohrab
    Zamirian, Mohammad
    JOURNAL OF MATHEMATICS AND COMPUTER SCIENCE-JMCS, 2011, 2 (03): : 565 - 571
  • [9] Possibilistic Regression Analysis by Support Vector Machine
    Hao, Pei-Yi
    IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ 2011), 2011, : 889 - 894
  • [10] Comparison of Random Forest and Support Vector Machine Regression for Prediction of BIM Labor Cost on Architectural Modeling and Plotting
    Huang C.-H.
    Hsieh S.-H.
    Journal of the Chinese Institute of Civil and Hydraulic Engineering, 2021, 33 (05): : 389 - 398