A mass appraisal assessment study using machine learning based on multiple regression and random forest

被引：60

作者：

Yilmazer, Seckin ^{[1
,2
]}

Kocaman, Sultan ^{[1
]}

机构：

[1] Hacettepe Univ, Dept Geomat Engn, TR-06800 Ankara, Turkey

[2] Gen Directorate Land Registry & Cadastre, Ankara, Turkey

来源：

LAND USE POLICY | 2020年 / 99卷

关键词：

Real estate valuation; Mass appraisal; Multiple regression analysis; Random Forest; PRICE; VALUATION; MODEL;

D O I：

10.1016/j.landusepol.2020.104889

中图分类号：

X [环境科学、安全科学];

学科分类号：

08 ; 0830 ;

摘要：

Mass appraisal is a complex matter because it depends on several categorical and continuously changing or constant parameters. In addition, development of new assessment approaches for mass appraisal of real estate properties in highly complex urban environments is desirable. The advancements in geospatial technologies and machine learning algorithms open up new horizons. For this reason, the purpose of the present study is to compare one conventional stepwise linear multiple regression (MRA) and one more automated machine learning approach, random forest (RF), for mass appraisal in an urban residential area where commercial properties are also available. A part of Mamak District, Ankara, Turkey is selected as the study area since the property values are diverse and representative. Additionally, the district has a complex and developing urban structure. The data employed in the study were compiled under a cadastral modernization project of General Directorate of the Land Registry and Cadastre of Turkey (GDLRC) and were based on the reports of licensed experts (similar to 50 %), court reports (similar to 20 %), field surveys, or a combined analysis of all. Consequently, the data used in the study has a high level of confidence. The initial set of parameters used in both methods reflect the most frequently observed characteristics of the real estate properties in the study area that are also effective on the values. The stepwise MRA required manual adjustments of the final parameter set by the expert, whereas RF eliminated unusable parameters fully automatically. The method performance was assessed by using a subset of the training data as a random test. According to the accuracy assessment results, the RF (Adjusted R-2 0.734; the total variance explained from the model) slightly outperforms the MRA (Adjusted R-2 0.696) where the optimal parameters were set by the human expert. Finally, the results exhibited are promising for quick assessment of mass appraisal and a comprehensive discussion is presented in the study.

引用

页数：11

共 50 条

[31] A Complex Terrain Simulation Approach Using Ensemble Learning of Random Forest Regression
Zechun Huang
Zipu Liu
Journal of the Indian Society of Remote Sensing, 2022, 50 : 2011 - 2023
[32] Prediction and the influencing factor study of colorectal cancer hospitalization costs in China based on machine learning-random forest and support vector regression: a retrospective study
Gao, Jun
Liu, Yan
FRONTIERS IN PUBLIC HEALTH, 2024, 12
[33] Process parameters based machine learning model for bead profile prediction in activated TIG Welding using random forest machine learning
Munghate, Abhinav Arun
Thapliyal, Shivraman
PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART B-JOURNAL OF ENGINEERING MANUFACTURE, 2024, 238 (12) : 1761 - 1768
[34] A study on forest fire risk assessment in jiangxi province based on machine learning and geostatistics
Lu, Jinping
Li, Mangen
Qin, Yaozu
Chen, Niannan
Wang, Lili
Yang, Wanzhen
Song, Yuke
Zheng, Yisu
ENVIRONMENTAL RESEARCH COMMUNICATIONS, 2024, 6 (12):
[35] Variable Importance Assessment in Regression: Linear Regression versus Random Forest
Groemping, Ulrike
AMERICAN STATISTICIAN, 2009, 63 (04): : 308 - 319
[36] Wet inrush susceptibility assessment at the Deep Ore Zone mine using a random forest machine learning model
Ghadirianniari, Sahar
Mcdougall, Scott
Eberhardt, Erik
Varian, Jovian
Llewelyn, Karl
Campbell, Ryan
Moss, Allan
MINING TECHNOLOGY-TRANSACTIONS OF THE INSTITUTIONS OF MINING AND METALLURGY, 2024, 133 (03) : 276 - 288
[37] Forest emissions reduction assessment from airborne LiDAR data using multiple machine learning approaches
Qin, Shize
Chen, Yiming
Yang, Bo
Zhu, Kaiwei
FRONTIERS IN ENERGY RESEARCH, 2023, 11
[38] Modeling of Flow-Accelerated Corrosion using Machine Learning: Comparison between Random Forest and Non-linear Regression
Lee, Gyeong-Geun
Lee, Eun Hee
Kim, Sung-Woo
Kim, Kyung-Mo
Kim, Dong-Jin
CORROSION SCIENCE AND TECHNOLOGY-KOREA, 2019, 18 (02): : 61 - 71
[39] Attribute-Based Assessment of Lung Nodules in CT Using Support Vector Machine and Random Forest
Choroba, Beata
Badura, Pawel
INFORMATION TECHNOLOGY IN BIOMEDICINE (ITIB 2018), 2019, 762 : 279 - 289
[40] An Efficient Comparative Machine Learning-based Metagenomics Binning Technique Via Using Random Forest
Saghir, Helal
Megherbi, Dalila B.
2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND VIRTUAL ENVIRONMENTS FOR MEASUREMENT SYSTEMS AND APPLICATIONS (CIVEMSA), 2013, : 191 - 196

← 1 2 3 4 5 →