A mass appraisal assessment study using machine learning based on multiple regression and random forest

被引:60
|
作者
Yilmazer, Seckin [1 ,2 ]
Kocaman, Sultan [1 ]
机构
[1] Hacettepe Univ, Dept Geomat Engn, TR-06800 Ankara, Turkey
[2] Gen Directorate Land Registry & Cadastre, Ankara, Turkey
关键词
Real estate valuation; Mass appraisal; Multiple regression analysis; Random Forest; PRICE; VALUATION; MODEL;
D O I
10.1016/j.landusepol.2020.104889
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Mass appraisal is a complex matter because it depends on several categorical and continuously changing or constant parameters. In addition, development of new assessment approaches for mass appraisal of real estate properties in highly complex urban environments is desirable. The advancements in geospatial technologies and machine learning algorithms open up new horizons. For this reason, the purpose of the present study is to compare one conventional stepwise linear multiple regression (MRA) and one more automated machine learning approach, random forest (RF), for mass appraisal in an urban residential area where commercial properties are also available. A part of Mamak District, Ankara, Turkey is selected as the study area since the property values are diverse and representative. Additionally, the district has a complex and developing urban structure. The data employed in the study were compiled under a cadastral modernization project of General Directorate of the Land Registry and Cadastre of Turkey (GDLRC) and were based on the reports of licensed experts (similar to 50 %), court reports (similar to 20 %), field surveys, or a combined analysis of all. Consequently, the data used in the study has a high level of confidence. The initial set of parameters used in both methods reflect the most frequently observed characteristics of the real estate properties in the study area that are also effective on the values. The stepwise MRA required manual adjustments of the final parameter set by the expert, whereas RF eliminated unusable parameters fully automatically. The method performance was assessed by using a subset of the training data as a random test. According to the accuracy assessment results, the RF (Adjusted R-2 0.734; the total variance explained from the model) slightly outperforms the MRA (Adjusted R-2 0.696) where the optimal parameters were set by the human expert. Finally, the results exhibited are promising for quick assessment of mass appraisal and a comprehensive discussion is presented in the study.
引用
收藏
页数:11
相关论文
共 50 条
  • [31] A Complex Terrain Simulation Approach Using Ensemble Learning of Random Forest Regression
    Zechun Huang
    Zipu Liu
    Journal of the Indian Society of Remote Sensing, 2022, 50 : 2011 - 2023
  • [32] Prediction and the influencing factor study of colorectal cancer hospitalization costs in China based on machine learning-random forest and support vector regression: a retrospective study
    Gao, Jun
    Liu, Yan
    FRONTIERS IN PUBLIC HEALTH, 2024, 12
  • [33] Process parameters based machine learning model for bead profile prediction in activated TIG Welding using random forest machine learning
    Munghate, Abhinav Arun
    Thapliyal, Shivraman
    PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART B-JOURNAL OF ENGINEERING MANUFACTURE, 2024, 238 (12) : 1761 - 1768
  • [34] A study on forest fire risk assessment in jiangxi province based on machine learning and geostatistics
    Lu, Jinping
    Li, Mangen
    Qin, Yaozu
    Chen, Niannan
    Wang, Lili
    Yang, Wanzhen
    Song, Yuke
    Zheng, Yisu
    ENVIRONMENTAL RESEARCH COMMUNICATIONS, 2024, 6 (12):
  • [35] Variable Importance Assessment in Regression: Linear Regression versus Random Forest
    Groemping, Ulrike
    AMERICAN STATISTICIAN, 2009, 63 (04): : 308 - 319
  • [36] Wet inrush susceptibility assessment at the Deep Ore Zone mine using a random forest machine learning model
    Ghadirianniari, Sahar
    Mcdougall, Scott
    Eberhardt, Erik
    Varian, Jovian
    Llewelyn, Karl
    Campbell, Ryan
    Moss, Allan
    MINING TECHNOLOGY-TRANSACTIONS OF THE INSTITUTIONS OF MINING AND METALLURGY, 2024, 133 (03) : 276 - 288
  • [37] Forest emissions reduction assessment from airborne LiDAR data using multiple machine learning approaches
    Qin, Shize
    Chen, Yiming
    Yang, Bo
    Zhu, Kaiwei
    FRONTIERS IN ENERGY RESEARCH, 2023, 11
  • [38] Modeling of Flow-Accelerated Corrosion using Machine Learning: Comparison between Random Forest and Non-linear Regression
    Lee, Gyeong-Geun
    Lee, Eun Hee
    Kim, Sung-Woo
    Kim, Kyung-Mo
    Kim, Dong-Jin
    CORROSION SCIENCE AND TECHNOLOGY-KOREA, 2019, 18 (02): : 61 - 71
  • [39] Attribute-Based Assessment of Lung Nodules in CT Using Support Vector Machine and Random Forest
    Choroba, Beata
    Badura, Pawel
    INFORMATION TECHNOLOGY IN BIOMEDICINE (ITIB 2018), 2019, 762 : 279 - 289
  • [40] An Efficient Comparative Machine Learning-based Metagenomics Binning Technique Via Using Random Forest
    Saghir, Helal
    Megherbi, Dalila B.
    2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND VIRTUAL ENVIRONMENTS FOR MEASUREMENT SYSTEMS AND APPLICATIONS (CIVEMSA), 2013, : 191 - 196