Improving hedonic housing price models by integrating optimal accessibility indices into regression and random forest analyses

被引:18
|
作者
Rey-Blanco, David [1 ]
Zofio, Jose L. [2 ,3 ]
Gonzalez-Arias, Julio [1 ]
机构
[1] UNED, Dept Business Econ, Paseo Senda Rey 11, Madrid 28040, Spain
[2] UAM, Dept Econ, Francisco Tomas & Valiente 5, Madrid 28049, Spain
[3] Erasmus Res Inst Management EUR, Burgemeester Oudlaan 50, NL-3062 PA Rotterdam, Netherlands
关键词
Hedonic housing price models; Accessibility indices; Spatial dependence; Machine learning; RESIDENTIAL PROPERTY; MASS APPRAISAL; VALUES; VALUATION; MARKET; LAND;
D O I
10.1016/j.eswa.2023.121059
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Location indices are key in explaining variation in house prices. However, the definition of comprehensive indices capturing all locational features, along with their efficient and timely calculation, is usually one of the most complex dimensions of house price modeling. Existing difficulties result in partial location specifications, mostly due to three hurdles: (1) there is not a consensus on the best method to construct these indices, (2) what features (variables) to include: labor, demographic, commuting, etc., and (3) its creation requires granular and updated datasets. We introduce a methodology based on computer algorithms to create car and walk accessibility indices that address the previous concerns and capture location interactions among a wide range of variables. The selection of variables is based on an automated search of the best performing utility bearing gravitational accessibility indices for price prediction. Once these optimal indices are obtained, the method applies principal components analysis to secure their orthogonality. Using a unique dataset from a leading real estate portal in Europe, we illustrate and test for the city of Madrid their applicability in several house asking price models that are estimated using regression analysis and random forests (as a representative family of machine learning techniques). The experimental analysis reveals that using the optimal indices results in significant improvements in accuracy, for both regression-based models (13%) and random forests (21.6%), while achieving a substantial reduction in spatial autocorrelation (around 35%). The generated indices are clearly interpretable, which makes them a valuable tool for urban analyses (planning, transportation, sustainability, etc.). Finally, the methodology can be extended to other types of real state (commercial, industrial, etc.) and location (country, region, etc.).
引用
收藏
页数:18
相关论文
共 10 条
  • [1] HEDONIC PRICE MODELS AND INDICES BASED ON BOOSTING APPLIED TO THE DUTCH HOUSING MARKET
    Kagie, Martijn
    Van Wezel, Michiel
    INTELLIGENT SYSTEMS IN ACCOUNTING FINANCE & MANAGEMENT, 2007, 15 (3-4): : 85 - 106
  • [2] How is Location Measured in Housing Valuation? A Systematic Review of Accessibility Specifications in Hedonic Price Models
    Heyman, Axel Viktor
    Law, Stephen
    Pont, Meta Berghauser
    URBAN SCIENCE, 2019, 3 (01)
  • [3] The Construction of Residential Housing Price Indices: A Comparison of Repeat-Sales, Hedonic-Regression, and Hybrid Approaches
    Nancy E. Wallace
    Richard A. Meese
    The Journal of Real Estate Finance and Economics, 1997, 14 (1-2) : 51 - 73
  • [4] The construction of residential housing price indices: A comparison of repeat-sales, hedonic-regression, and hybrid approaches
    Meese, RA
    Wallace, NE
    JOURNAL OF REAL ESTATE FINANCE AND ECONOMICS, 1997, 14 (1-2): : 51 - 73
  • [5] Comparative Models of Price Estimation Using Multiple Linear Regression and Random Forest Methods
    Crosss Sihombing, Denny Jean
    Othernima, Desi C.
    Manurung, Jonson
    Sagala, Jijon Raphita
    ICCoSITE 2023 - International Conference on Computer Science, Information Technology and Engineering: Digital Transformation Strategy in Facing the VUCA and TUNA Era, 2023, : 478 - 483
  • [6] Aggregated Housing Price Predictions with No Information About Structural Attributes-Hedonic Models: Linear Regression and a Machine Learning Approach
    Jaroszewicz, Joanna
    Horynek, Hubert
    LAND, 2024, 13 (11)
  • [7] Seepage and dam deformation analyses with statistical models: support vector regression machine and random forest
    Belmokre, Ahmed
    Mihoubi, Mustapha Kamel
    Santillan, David
    3RD INTERNATIONAL CONFERENCE ON STRUCTURAL INTEGRITY (ICSI 2019), 2019, 17 : 698 - 703
  • [8] Spatial heterogeneity of marginal willingness to pay for air quality in PM2.5: analysis of buyers’ housing price in Beijing through hedonic price, spatial regression, and quantile regression models
    Chao Zhang
    Mimi Xiong
    Xuehui Wei
    Zongmin Lan
    Asia-Pacific Journal of Regional Science, 2023, 7 : 697 - 720
  • [9] Spatial heterogeneity of marginal willingness to pay for air quality in PM2.5: analysis of buyers' housing price in Beijing through hedonic price, spatial regression, and quantile regression models
    Zhang, Chao
    Xiong, Mimi
    Wei, Xuehui
    Lan, Zongmin
    ASIA-PACIFIC JOURNAL OF REGIONAL SCIENCE, 2023, 7 (03) : 697 - 720
  • [10] Predictive modelling of pump noise using multi-linear regression and random-forest models – via optimal data splitting
    John M.M.
    Ganiny S.
    Hanief M.
    International Journal of Simulation and Process Modelling, 2022, 18 (04) : 267 - 283