Estimating the Performance of Random Forest versus Multiple Regression for Predicting Prices of the Apartments

被引:0
|
作者
Ceh, Marjan [1 ]
Kilibarda, Milan [2 ]
Lisec, Anka [1 ]
Bajat, Branislav [2 ]
机构
[1] Univ Ljubljana, Fac Civil & Geodet Engn, Ljubljana 1000, Slovenia
[2] Univ Belgrade, Fac Civil Engn, Bulevar Kralja Aleksandra 73, Belgrade 11000, Serbia
关键词
random forest; OLS; hedonic price model; PCA; Ljubljana; MODEL;
D O I
10.3390/ijgi7050168
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The goal of this study is to analyse the predictive performance of the random forest machine learning technique in comparison to commonly used hedonic models based on multiple regression for the prediction of apartment prices. A data set that includes 7407 records of apartment transactions referring to real estate sales from 2008-2013 in the city of Ljubljana, the capital of Slovenia, was used in order to test and compare the predictive performances of both models. Apparent challenges faced during modelling included (1) the non-linear nature of the prediction assignment task; (2) input data being based on transactions occurring over a period of great price changes in Ljubljana whereby a 28% decline was noted in six consecutive testing years; and (3) the complex urban form of the case study area. Available explanatory variables, organised as a Geographic Information Systems (GIS) ready dataset, including the structural and age characteristics of the apartments as well as environmental and neighbourhood information were considered in the modelling procedure. All performance measures (R-2 values, sales ratios, mean average percentage error (MAPE), coefficient of dispersion (COD)) revealed significantly better results for predictions obtained by the random forest method, which confirms the prospective of this machine learning technique on apartment price prediction.
引用
收藏
页数:16
相关论文
共 50 条
  • [41] Estimating and Interpreting Fine-Scale Gridded Population Using Random Forest Regression and Multisource Data
    Zhou, Yun
    Ma, Mingguo
    Shi, Kaifang
    Peng, Zhenyu
    ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2020, 9 (06)
  • [42] Random Forest Regression Analysis for Estimating Dielectric Properties in Epoxy Composites Doped with Hybrid Nano Fillers
    Shingala, Bansi
    Panchal, Piyushkumar
    Thakor, Sanketsinh
    Jain, Prince
    Joshi, Anand
    Vaja, Chandan R.
    Siddharth, R. K.
    Rana, V. A.
    JOURNAL OF MACROMOLECULAR SCIENCE PART B-PHYSICS, 2024, 63 (12): : 1297 - 1311
  • [43] A comparative analysis of linear regression, neural networks and random forest regression for predicting air ozone employing soft sensor models
    Zhou, Zheng
    Qiu, Cheng
    Zhang, Yufan
    SCIENTIFIC REPORTS, 2023, 13 (01)
  • [44] A comparative analysis of linear regression, neural networks and random forest regression for predicting air ozone employing soft sensor models
    Zheng Zhou
    Cheng Qiu
    Yufan Zhang
    Scientific Reports, 13
  • [45] Predicting a DNA-binding protein using random forest with multiple mathematical features
    Guan, Changge
    Niu, Xiaohui
    Shi, Feng
    Yang, Kun
    Li, Nana
    BIO-MEDICAL MATERIALS AND ENGINEERING, 2015, 26 : S1883 - S1889
  • [46] Large-scale multivariate forecasting models for Dengue - LSTM versus random forest regression
    Mussumeci, Elisa
    Coelho, Flavio Codeco
    SPATIAL AND SPATIO-TEMPORAL EPIDEMIOLOGY, 2020, 35
  • [47] Foreign Versus Local Ownership and Performance in Eastern Versus Western EU: A Random Forest Application
    Horobet, Alexandra
    Popovici, Oana Cristina
    Bulai, Vlad
    Belascu, Lucian
    Rosca, Eugen
    INZINERINE EKONOMIKA-ENGINEERING ECONOMICS, 2023, 34 (02): : 123 - 138
  • [48] Performance Evaluation of Random Forest Regression Model in Tracking Parkinson's Disease Progress
    Peterek, Tomas
    Dohnalek, Pavel
    Gajdos, Petr
    Smondrk, Maros
    2013 13TH INTERNATIONAL CONFERENCE ON HYBRID INTELLIGENT SYSTEMS (HIS), 2013, : 83 - 87
  • [49] Comparing Random Forest with Logistic Regression for Predicting Class-Imbalanced Civil War Onset Data
    Muchlinski, David
    Siroky, David
    He, Jingrui
    Kocher, Matthew
    POLITICAL ANALYSIS, 2016, 24 (01) : 87 - 103
  • [50] The Comparative Performance of Logistic Regression and Random Forest in Propensity Score Methods: a Simulation Study
    Ali, M. Sanni
    Khalid, Sara
    Collins, Gary S.
    Prieto-Alhambra, Daniel
    PHARMACOEPIDEMIOLOGY AND DRUG SAFETY, 2017, 26 : 489 - 489