Estimating the Performance of Random Forest versus Multiple Regression for Predicting Prices of the Apartments

被引:0
|
作者
Ceh, Marjan [1 ]
Kilibarda, Milan [2 ]
Lisec, Anka [1 ]
Bajat, Branislav [2 ]
机构
[1] Univ Ljubljana, Fac Civil & Geodet Engn, Ljubljana 1000, Slovenia
[2] Univ Belgrade, Fac Civil Engn, Bulevar Kralja Aleksandra 73, Belgrade 11000, Serbia
关键词
random forest; OLS; hedonic price model; PCA; Ljubljana; MODEL;
D O I
10.3390/ijgi7050168
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The goal of this study is to analyse the predictive performance of the random forest machine learning technique in comparison to commonly used hedonic models based on multiple regression for the prediction of apartment prices. A data set that includes 7407 records of apartment transactions referring to real estate sales from 2008-2013 in the city of Ljubljana, the capital of Slovenia, was used in order to test and compare the predictive performances of both models. Apparent challenges faced during modelling included (1) the non-linear nature of the prediction assignment task; (2) input data being based on transactions occurring over a period of great price changes in Ljubljana whereby a 28% decline was noted in six consecutive testing years; and (3) the complex urban form of the case study area. Available explanatory variables, organised as a Geographic Information Systems (GIS) ready dataset, including the structural and age characteristics of the apartments as well as environmental and neighbourhood information were considered in the modelling procedure. All performance measures (R-2 values, sales ratios, mean average percentage error (MAPE), coefficient of dispersion (COD)) revealed significantly better results for predictions obtained by the random forest method, which confirms the prospective of this machine learning technique on apartment price prediction.
引用
收藏
页数:16
相关论文
共 50 条
  • [1] Estimating residual variance in random forest regression
    Mendez, Guillermo
    Lohr, Sharon
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2011, 55 (11) : 2937 - 2950
  • [2] Variable Importance Assessment in Regression: Linear Regression versus Random Forest
    Groemping, Ulrike
    AMERICAN STATISTICIAN, 2009, 63 (04): : 308 - 319
  • [3] A comparison of random forest regression and multiple linear regression for prediction in neuroscience
    Smith, Paul F.
    Ganesh, Siva
    Liu, Ping
    JOURNAL OF NEUROSCIENCE METHODS, 2013, 220 (01) : 85 - 91
  • [4] Predicting Popularity of Online Articles using Random Forest Regression
    Shreyas, R.
    Akshata, D. M.
    Mahanand, B. S.
    Shagun, B.
    Abhishek, C. M.
    2016 SECOND INTERNATIONAL CONFERENCE ON COGNITIVE COMPUTING AND INFORMATION PROCESSING (CCIP), 2016,
  • [5] Random Forest Regression in Predicting Students' Achievements and Fuzzy Grades
    Doz, Daniel
    Cotic, Mara
    Felda, Darjo
    MATHEMATICS, 2023, 11 (19)
  • [6] Performance Comparison of Support Vector Regression, Random Forest and Multiple Linear Regression to Forecast the Power of Photovoltaic Panels
    Chahboun, Souhaila
    Maaroufi, Mohamed
    PROCEEDINGS OF 2021 9TH INTERNATIONAL RENEWABLE AND SUSTAINABLE ENERGY CONFERENCE (IRSEC), 2021, : 95 - 98
  • [7] Evaluation of random forest regression and multiple linear regression for predicting indoor fine particulate matter concentrations in a highly polluted city
    Yuchi, Weiran
    Gombojav, Enkhjargal
    Boldbaatar, Buyantushig
    Galsuren, Jargalsaikhan
    Enkhmaa, Sarangerel
    Beejin, Bolor
    Naidan, Gerel
    Ochir, Chimedsuren
    Legtseg, Bayarkhuu
    Byambaa, Tsogtbaatar
    Barn, Prabjit
    Henderson, Sarah B.
    Janes, Craig R.
    Lanphear, Bruce P.
    McCandless, Lawrence C.
    Takaro, Tim K.
    Venners, Scott A.
    Webster, Glenys M.
    Allen, Ryan W.
    ENVIRONMENTAL POLLUTION, 2019, 245 : 746 - 753
  • [8] Regression and Random Forest Machine Learning Have Limited Performance in Predicting Bowel Preparation in Veteran Population
    Jacob E. Kurlander
    Akbar K. Waljee
    Stacy B. Menees
    Rachel Lipson
    Alex N. Kokaly
    Andrew J. Read
    Karmel S. Shehadeh
    Amy Cohn
    Sameer D. Saini
    Digestive Diseases and Sciences, 2022, 67 : 2827 - 2841
  • [9] Regression and Random Forest Machine Learning Have Limited Performance in Predicting Bowel Preparation in Veteran Population
    Kurlander, Jacob E.
    Waljee, Akbar K.
    Menees, Stacy B.
    Lipson, Rachel
    Kokaly, Alex N.
    Read, Andrew J.
    Shehadeh, Karmel S.
    Cohn, Amy
    Saini, Sameer D.
    DIGESTIVE DISEASES AND SCIENCES, 2022, 67 (07) : 2827 - 2841
  • [10] A Comparison of Logistic Regression, Random Forest Models in Predicting the Risk of Diabetes
    Zhang, Baoxin
    Lu, Li
    Hou, Jiaqi
    THIRD INTERNATIONAL SYMPOSIUM ON IMAGE COMPUTING AND DIGITAL MEDICINE (ISICDM 2019), 2019, : 231 - 234