A mass appraisal assessment study using machine learning based on multiple regression and random forest

被引:51
|
作者
Yilmazer, Seckin [1 ,2 ]
Kocaman, Sultan [1 ]
机构
[1] Hacettepe Univ, Dept Geomat Engn, TR-06800 Ankara, Turkey
[2] Gen Directorate Land Registry & Cadastre, Ankara, Turkey
关键词
Real estate valuation; Mass appraisal; Multiple regression analysis; Random Forest; PRICE; VALUATION; MODEL;
D O I
10.1016/j.landusepol.2020.104889
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Mass appraisal is a complex matter because it depends on several categorical and continuously changing or constant parameters. In addition, development of new assessment approaches for mass appraisal of real estate properties in highly complex urban environments is desirable. The advancements in geospatial technologies and machine learning algorithms open up new horizons. For this reason, the purpose of the present study is to compare one conventional stepwise linear multiple regression (MRA) and one more automated machine learning approach, random forest (RF), for mass appraisal in an urban residential area where commercial properties are also available. A part of Mamak District, Ankara, Turkey is selected as the study area since the property values are diverse and representative. Additionally, the district has a complex and developing urban structure. The data employed in the study were compiled under a cadastral modernization project of General Directorate of the Land Registry and Cadastre of Turkey (GDLRC) and were based on the reports of licensed experts (similar to 50 %), court reports (similar to 20 %), field surveys, or a combined analysis of all. Consequently, the data used in the study has a high level of confidence. The initial set of parameters used in both methods reflect the most frequently observed characteristics of the real estate properties in the study area that are also effective on the values. The stepwise MRA required manual adjustments of the final parameter set by the expert, whereas RF eliminated unusable parameters fully automatically. The method performance was assessed by using a subset of the training data as a random test. According to the accuracy assessment results, the RF (Adjusted R-2 0.734; the total variance explained from the model) slightly outperforms the MRA (Adjusted R-2 0.696) where the optimal parameters were set by the human expert. Finally, the results exhibited are promising for quick assessment of mass appraisal and a comprehensive discussion is presented in the study.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Indication of Health Status Using Machine Learning Linear Regression and Random Forest
    Asif, Arslan
    Nabeel, Muhammad
    Awan, Mazhar Javed
    Ahsan, Muhammad
    Hannan, Abdul
    Abbas, Shahroz
    [J]. 4TH INTERNATIONAL CONFERENCE ON INNOVATIVE COMPUTING (IC)2, 2021, : 464 - 469
  • [2] Machine Learning Scoring Functions Based on Random Forest and Support Vector Regression
    Ballester, Pedro J.
    [J]. PATTERN RECOGNITION IN BIOINFORMATICS, 2012, 7632 : 14 - 25
  • [3] Prediction of size and mass of pistachio kernels using random Forest machine learning
    Vidyarthi, Sriram K.
    Tiwari, Rakhee
    Singh, Samrendra K.
    Xiao, Hong-Wei
    [J]. JOURNAL OF FOOD PROCESS ENGINEERING, 2020, 43 (09)
  • [4] A mass appraisal assessment study of land values using spatial analysis and multiple regression analysis model (MRA)
    Velumani, P.
    Priyadharshini, B.
    Mukilan, K.
    Shanmugapriya
    [J]. MATERIALS TODAY-PROCEEDINGS, 2022, 66 : 2614 - 2625
  • [5] Land subsidence susceptibility assessment using random forest machine learning algorithm
    Majid Mohammady
    Hamid Reza Pourghasemi
    Mojtaba Amiri
    [J]. Environmental Earth Sciences, 2019, 78
  • [6] Land subsidence susceptibility assessment using random forest machine learning algorithm
    Mohammady, Majid
    Pourghasemi, Hamid Reza
    Amiri, Mojtaba
    [J]. ENVIRONMENTAL EARTH SCIENCES, 2019, 78 (16)
  • [7] Machine Learning and Risk Assessment: Random Forest Does Not Outperform Logistic Regression in the Prediction of Sexual Recidivism
    Etzler, Sonja
    Schonbrodt, Felix D.
    Pargent, Florian
    Eher, Reinhard
    Rettenberger, Martin
    [J]. ASSESSMENT, 2024, 31 (02) : 460 - 481
  • [8] Machine learning prediction of the mechanical properties of γ-TiAl alloys produced using random forest regression model
    Kwak, Seungmi
    Kim, Jaehwang
    Ding, Hongsheng
    Xu, Xuesong
    Chen, Ruirun
    Guo, Jingjie
    Fu, Hengzhi
    [J]. JOURNAL OF MATERIALS RESEARCH AND TECHNOLOGY-JMR&T, 2022, 18 : 520 - 530
  • [9] Using a cohort study of diabetes and peripheral artery disease to compare logistic regression and machine learning via random forest modeling
    Andrea M. Austin
    Niveditta Ramkumar
    Barbara Gladders
    Jonathan A. Barnes
    Mark A. Eid
    Kayla O. Moore
    Mark W. Feinberg
    Mark A. Creager
    Marc Bonaca
    Philip P. Goodney
    [J]. BMC Medical Research Methodology, 22
  • [10] Learning-Based Colorization of Grayscale Aerial Images Using Random Forest Regression
    Seo, Dae Kyo
    Kim, Yong Hyun
    Eo, Yang Dam
    Park, Wan Yong
    [J]. APPLIED SCIENCES-BASEL, 2018, 8 (08):