A mass appraisal assessment study using machine learning based on multiple regression and random forest

被引:51
|
作者
Yilmazer, Seckin [1 ,2 ]
Kocaman, Sultan [1 ]
机构
[1] Hacettepe Univ, Dept Geomat Engn, TR-06800 Ankara, Turkey
[2] Gen Directorate Land Registry & Cadastre, Ankara, Turkey
关键词
Real estate valuation; Mass appraisal; Multiple regression analysis; Random Forest; PRICE; VALUATION; MODEL;
D O I
10.1016/j.landusepol.2020.104889
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Mass appraisal is a complex matter because it depends on several categorical and continuously changing or constant parameters. In addition, development of new assessment approaches for mass appraisal of real estate properties in highly complex urban environments is desirable. The advancements in geospatial technologies and machine learning algorithms open up new horizons. For this reason, the purpose of the present study is to compare one conventional stepwise linear multiple regression (MRA) and one more automated machine learning approach, random forest (RF), for mass appraisal in an urban residential area where commercial properties are also available. A part of Mamak District, Ankara, Turkey is selected as the study area since the property values are diverse and representative. Additionally, the district has a complex and developing urban structure. The data employed in the study were compiled under a cadastral modernization project of General Directorate of the Land Registry and Cadastre of Turkey (GDLRC) and were based on the reports of licensed experts (similar to 50 %), court reports (similar to 20 %), field surveys, or a combined analysis of all. Consequently, the data used in the study has a high level of confidence. The initial set of parameters used in both methods reflect the most frequently observed characteristics of the real estate properties in the study area that are also effective on the values. The stepwise MRA required manual adjustments of the final parameter set by the expert, whereas RF eliminated unusable parameters fully automatically. The method performance was assessed by using a subset of the training data as a random test. According to the accuracy assessment results, the RF (Adjusted R-2 0.734; the total variance explained from the model) slightly outperforms the MRA (Adjusted R-2 0.696) where the optimal parameters were set by the human expert. Finally, the results exhibited are promising for quick assessment of mass appraisal and a comprehensive discussion is presented in the study.
引用
收藏
页数:11
相关论文
共 50 条
  • [31] Prediction and the influencing factor study of colorectal cancer hospitalization costs in China based on machine learning-random forest and support vector regression: a retrospective study
    Gao, Jun
    Liu, Yan
    [J]. FRONTIERS IN PUBLIC HEALTH, 2024, 12
  • [32] A Complex Terrain Simulation Approach Using Ensemble Learning of Random Forest Regression
    Huang, Zechun
    Liu, Zipu
    [J]. JOURNAL OF THE INDIAN SOCIETY OF REMOTE SENSING, 2022, 50 (10) : 2011 - 2023
  • [33] A Complex Terrain Simulation Approach Using Ensemble Learning of Random Forest Regression
    Zechun Huang
    Zipu Liu
    [J]. Journal of the Indian Society of Remote Sensing, 2022, 50 : 2011 - 2023
  • [34] Wet inrush susceptibility assessment at the Deep Ore Zone mine using a random forest machine learning model
    Ghadirianniari, Sahar
    Mcdougall, Scott
    Eberhardt, Erik
    Varian, Jovian
    Llewelyn, Karl
    Campbell, Ryan
    Moss, Allan
    [J]. MINING TECHNOLOGY-TRANSACTIONS OF THE INSTITUTIONS OF MINING AND METALLURGY, 2024,
  • [35] Variable Importance Assessment in Regression: Linear Regression versus Random Forest
    Groemping, Ulrike
    [J]. AMERICAN STATISTICIAN, 2009, 63 (04): : 308 - 319
  • [36] Forest emissions reduction assessment from airborne LiDAR data using multiple machine learning approaches
    Qin, Shize
    Chen, Yiming
    Yang, Bo
    Zhu, Kaiwei
    [J]. FRONTIERS IN ENERGY RESEARCH, 2023, 11
  • [37] Modeling of Flow-Accelerated Corrosion using Machine Learning: Comparison between Random Forest and Non-linear Regression
    Lee, Gyeong-Geun
    Lee, Eun Hee
    Kim, Sung-Woo
    Kim, Kyung-Mo
    Kim, Dong-Jin
    [J]. CORROSION SCIENCE AND TECHNOLOGY-KOREA, 2019, 18 (02): : 61 - 71
  • [38] Attribute-Based Assessment of Lung Nodules in CT Using Support Vector Machine and Random Forest
    Choroba, Beata
    Badura, Pawel
    [J]. INFORMATION TECHNOLOGY IN BIOMEDICINE (ITIB 2018), 2019, 762 : 279 - 289
  • [39] Analysis of Gas Turbine Compressor Degradation Using Random Forest-based Machine Learning Model
    Bang, Myeonghwan
    Kang, Haesu
    Lee, Kyuheon
    Oh, Chansu
    Choi, Woosung
    Park, Gyusang
    Kim, Doosoo
    [J]. TRANSACTIONS OF THE KOREAN SOCIETY OF MECHANICAL ENGINEERS B, 2021, 45 (11) : 605 - 612
  • [40] Faults detection and diagnosis of PV systems based on machine learning approach using random forest classifier
    Amiri, Ahmed Faris
    Oudira, Houcine
    Chouder, Aissa
    Kichou, Sofiane
    [J]. ENERGY CONVERSION AND MANAGEMENT, 2024, 301