Estimating the Performance of Random Forest versus Multiple Regression for Predicting Prices of the Apartments

被引:0
|
作者
Ceh, Marjan [1 ]
Kilibarda, Milan [2 ]
Lisec, Anka [1 ]
Bajat, Branislav [2 ]
机构
[1] Univ Ljubljana, Fac Civil & Geodet Engn, Ljubljana 1000, Slovenia
[2] Univ Belgrade, Fac Civil Engn, Bulevar Kralja Aleksandra 73, Belgrade 11000, Serbia
关键词
random forest; OLS; hedonic price model; PCA; Ljubljana; MODEL;
D O I
10.3390/ijgi7050168
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The goal of this study is to analyse the predictive performance of the random forest machine learning technique in comparison to commonly used hedonic models based on multiple regression for the prediction of apartment prices. A data set that includes 7407 records of apartment transactions referring to real estate sales from 2008-2013 in the city of Ljubljana, the capital of Slovenia, was used in order to test and compare the predictive performances of both models. Apparent challenges faced during modelling included (1) the non-linear nature of the prediction assignment task; (2) input data being based on transactions occurring over a period of great price changes in Ljubljana whereby a 28% decline was noted in six consecutive testing years; and (3) the complex urban form of the case study area. Available explanatory variables, organised as a Geographic Information Systems (GIS) ready dataset, including the structural and age characteristics of the apartments as well as environmental and neighbourhood information were considered in the modelling procedure. All performance measures (R-2 values, sales ratios, mean average percentage error (MAPE), coefficient of dispersion (COD)) revealed significantly better results for predictions obtained by the random forest method, which confirms the prospective of this machine learning technique on apartment price prediction.
引用
收藏
页数:16
相关论文
共 50 条
  • [21] Application of random forest regression and comparison of its performance to multiple linear regression in modeling groundwater nitrate concentration at the African continent scale
    Ouedraogo, Issoufou
    Defourny, Pierre
    Vanclooster, Marnik
    HYDROGEOLOGY JOURNAL, 2019, 27 (03) : 1081 - 1098
  • [22] The Performance Comparison of Multiple Linear Regression, Random Forest and Artificial Neural Network by using Photovoltaic and Atmospheric Data
    Kayri, Murat
    Kayri, Ismail
    Gencoglu, Muhsin Tunay
    2017 14TH INTERNATIONAL CONFERENCE ON ENGINEERING OF MODERN ELECTRIC SYSTEMS (EMES), 2017, : 1 - 4
  • [23] Optimized Ensemble Support Vector Regression Models for Predicting Stock Prices with Multiple Kernels
    Thumu, Subba Reddy
    Nellore, Geethanjali
    ACTA INFORMATICA PRAGENSIA, 2024, 13 (01) : 24 - 37
  • [24] Random forest versus logistic regression: a large-scale benchmark experiment
    Couronne, Raphael
    Probst, Philipp
    Boulesteix, Anne-Laure
    BMC BIOINFORMATICS, 2018, 19
  • [25] Random forest versus logistic regression: a large-scale benchmark experiment
    Raphael Couronné
    Philipp Probst
    Anne-Laure Boulesteix
    BMC Bioinformatics, 19
  • [26] A Random Forest Regression Model for Predicting the Movement of Horseshoe Crabs in Long Island Sound
    Senbel, Samah
    Kasinak, Jo-Marie Elisha
    Mattei, Jennifer
    COMPUTATIONAL SCIENCE AND ITS APPLICATIONS, ICCSA 2021, PT IV, 2021, 12952 : 107 - 119
  • [27] Predicting Ozone Layer Concentration Using Multivariate Adaptive Regression Splines, Random Forest and Classification and Regression Tree
    Roy, Sanjiban Sekhar
    Pratyush, Chitransh
    Barna, Cornel
    SOFT COMPUTING APPLICATIONS, SOFA 2016, VOL 2, 2018, 634 : 140 - 152
  • [28] Performance of Conditional Random Forest and Regression Models at Predicting Human Fecal Contamination of Produce Irrigation Ponds in the Southeastern United States
    Hofstetter, Jessica
    Holcomb, David A.
    Kahler, Amy M.
    Rodrigues, Camila
    da Silva, Andre Luiz Biscaia Ribeiro
    Mattioli, Mia C.
    ACS ES&T WATER, 2024, 4 (12): : 5844 - 5855
  • [29] Predicting Students Academic Performance using an Improved Random Forest Classifier
    Jayaprakash, Sujith
    Krishnan, Sangeetha
    Jaiganesh, V
    2020 INTERNATIONAL CONFERENCE ON EMERGING SMART COMPUTING AND INFORMATICS (ESCI), 2020, : 238 - 243
  • [30] Estimating persistent oil contamination in tropical region using vegetation indices and random forest regression
    Lassalle, Guillaume
    Credoz, Anthony
    Hedacq, Remy
    Bertoni, Georges
    Dubucq, Dominique
    Fabre, Sophie
    Elger, Arnaud
    ECOTOXICOLOGY AND ENVIRONMENTAL SAFETY, 2019, 184