A mass appraisal assessment study using machine learning based on multiple regression and random forest

被引:60
|
作者
Yilmazer, Seckin [1 ,2 ]
Kocaman, Sultan [1 ]
机构
[1] Hacettepe Univ, Dept Geomat Engn, TR-06800 Ankara, Turkey
[2] Gen Directorate Land Registry & Cadastre, Ankara, Turkey
关键词
Real estate valuation; Mass appraisal; Multiple regression analysis; Random Forest; PRICE; VALUATION; MODEL;
D O I
10.1016/j.landusepol.2020.104889
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Mass appraisal is a complex matter because it depends on several categorical and continuously changing or constant parameters. In addition, development of new assessment approaches for mass appraisal of real estate properties in highly complex urban environments is desirable. The advancements in geospatial technologies and machine learning algorithms open up new horizons. For this reason, the purpose of the present study is to compare one conventional stepwise linear multiple regression (MRA) and one more automated machine learning approach, random forest (RF), for mass appraisal in an urban residential area where commercial properties are also available. A part of Mamak District, Ankara, Turkey is selected as the study area since the property values are diverse and representative. Additionally, the district has a complex and developing urban structure. The data employed in the study were compiled under a cadastral modernization project of General Directorate of the Land Registry and Cadastre of Turkey (GDLRC) and were based on the reports of licensed experts (similar to 50 %), court reports (similar to 20 %), field surveys, or a combined analysis of all. Consequently, the data used in the study has a high level of confidence. The initial set of parameters used in both methods reflect the most frequently observed characteristics of the real estate properties in the study area that are also effective on the values. The stepwise MRA required manual adjustments of the final parameter set by the expert, whereas RF eliminated unusable parameters fully automatically. The method performance was assessed by using a subset of the training data as a random test. According to the accuracy assessment results, the RF (Adjusted R-2 0.734; the total variance explained from the model) slightly outperforms the MRA (Adjusted R-2 0.696) where the optimal parameters were set by the human expert. Finally, the results exhibited are promising for quick assessment of mass appraisal and a comprehensive discussion is presented in the study.
引用
收藏
页数:11
相关论文
共 50 条
  • [21] Machine Learning Predictive Models for Pile Drivability: An Evaluation of Random Forest Regression and Multivariate Adaptive Regression Splines
    Zhang, Wengang
    Wu, Chongzhi
    INFORMATION TECHNOLOGY IN GEO-ENGINEERING, 2020, : 243 - 255
  • [22] Application Research on Risk Assessment of Municipal Pipeline Network Based on Random Forest Machine Learning Algorithm
    Cen, Hang
    Huang, Delong
    Liu, Qiang
    Zong, Zhongling
    Tang, Aiping
    WATER, 2023, 15 (10)
  • [23] Multifidelity aerodynamic flow field prediction using random forest-based machine learning
    Nagawkar, Jethro
    Leifsson, Leifur
    AEROSPACE SCIENCE AND TECHNOLOGY, 2022, 123
  • [24] Leptospirosis modelling using hydrometeorological indices and random forest machine learning
    Jayaramu, Veianthan
    Zulkafli, Zed
    De Stercke, Simon
    Buytaert, Wouter
    Rahmat, Fariq
    Rahman, Ribhan Zafira Abdul
    Ishak, Asnor Juraiza
    Tahir, Wardah
    Ab Rahman, Jamalludin
    Fuzi, Nik Mohd Hafiz Mohd
    INTERNATIONAL JOURNAL OF BIOMETEOROLOGY, 2023, 67 (03) : 423 - 437
  • [25] Classification of Phishing Email Using Random Forest Machine Learning Technique
    Akinyelu, Andronicus A.
    Adewumi, Aderemi O.
    JOURNAL OF APPLIED MATHEMATICS, 2014,
  • [26] House Price Prediction using Random Forest Machine Learning Technique
    Adetunji, Abigail Bola
    Akande, Oluwatobi Noah
    Ajala, Funmilola Alaba
    Oyewo, Ololade
    Akande, Yetunde Faith
    Oluwadara, Gbenle
    8TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND QUANTITATIVE MANAGEMENT (ITQM 2020 & 2021): DEVELOPING GLOBAL DIGITAL ECONOMY AFTER COVID-19, 2022, 199 : 806 - 813
  • [27] Leptospirosis modelling using hydrometeorological indices and random forest machine learning
    Veianthan Jayaramu
    Zed Zulkafli
    Simon De Stercke
    Wouter Buytaert
    Fariq Rahmat
    Ribhan Zafira Abdul Rahman
    Asnor Juraiza Ishak
    Wardah Tahir
    Jamalludin Ab Rahman
    Nik Mohd Hafiz Mohd Fuzi
    International Journal of Biometeorology, 2023, 67 : 423 - 437
  • [28] A comparison of random forest regression and multiple linear regression for prediction in neuroscience
    Smith, Paul F.
    Ganesh, Siva
    Liu, Ping
    JOURNAL OF NEUROSCIENCE METHODS, 2013, 220 (01) : 85 - 91
  • [29] The linear random forest algorithm and its advantages in machine learning assisted logging regression modeling
    Ao, Yile
    Li, Hongqi
    Zhu, Liping
    Ali, Sikandar
    Yang, Zhongguo
    JOURNAL OF PETROLEUM SCIENCE AND ENGINEERING, 2019, 174 : 776 - 789
  • [30] A Complex Terrain Simulation Approach Using Ensemble Learning of Random Forest Regression
    Huang, Zechun
    Liu, Zipu
    JOURNAL OF THE INDIAN SOCIETY OF REMOTE SENSING, 2022, 50 (10) : 2011 - 2023