Home price index: A machine learning methodology

被引:7
|
作者
Barr J.R. [1 ]
Ellis E.A. [1 ]
Kassab A. [1 ]
Redfearn C.L. [2 ]
Srinivasan N.N. [1 ]
Voris K.B. [1 ]
机构
[1] Department of Data Science, HomeUnion 2010 Main St 250, Irvine, 92614, CA
[2] Department of Public Policy, University of Southern California, Los Angeles, 90089, CA
来源
| 1600年 / World Scientific卷 / 11期
关键词
big data; gradient boosting; Home price index; machine learning; median sales; real estate; regression trees; repeat sales; spatial-Temporal valuation; spline smoothing;
D O I
10.1142/S1793351X17500015
中图分类号
学科分类号
摘要
Estimating house prices is essential for homeowners and investors alike with both needing to understand the value of their asset, and to understand real estate assets as part of an overall portfolios. Commonly-used indices like the National Association of Realtors (NAR) median home price index, or the celebrated Case-Shiller Home Price Index are reported exclusively over a large geographic areas, i.e., a metropolitan, whereby home price dynamics are lost. In this paper, we propose a improved method to capture price dynamics over time at the most granular level possible-A single home. Using over 16 years of home sale data, from the year 2000 to 2016, we estimate home price index for each house. Once home price dynamics is captured, its possible to aggregate price dynamics to construct a price index over geographies of any kind, e.g., ZIP code. This particular index relies on a so-called 'gradient boosted' model, a methodology framework relying on multiple calibration parameters and heavily dependent on sampling techniques. We demonstrate that this approach offers several strengths compared to the commonly reported indices, the 'median sale' and 'repeat sales' indices. © 2017 World Scientific Publishing Company.
引用
收藏
页码:111 / 133
页数:22
相关论文
共 50 条