Spatial Prediction of Apartment Rent using Regression-Based and Machine Learning-Based Approaches with a Large Dataset

被引:3
|
作者
Yoshida, Takahiro [1 ]
Murakami, Daisuke [2 ]
Seya, Hajime [3 ]
机构
[1] Univ Tokyo, Ctr Spatial Informat Sci, Kashiwa, Chiba, Japan
[2] Inst Stat Math, Dept Stat Data Sci, Tachikawa, Tokyo, Japan
[3] Kobe Univ, Grad Sch Engn, Dept Civil Engn, Kobe, Hyogo, Japan
来源
关键词
Apartment Rent Price Prediction; Large Data; Nearest Neighbor Gaussian Processes (NNGP); Deep Neural Network (DNN); Extreme Gradient Boosting (XGBoost); Random Forest (RF); NEURAL-NETWORK; FRAMEWORK; STATISTICS; MODELS;
D O I
10.1007/s11146-022-09929-6
中图分类号
F8 [财政、金融];
学科分类号
0202 ;
摘要
Employing a large dataset (at most, the order of n = 10(6)), this study attempts enhance the literature on the comparison between regression and machine learning-based rent price prediction models by adding new empirical evidence and considering the spatial dependence of the observations. The regression-based approach incorporates the nearest neighbor Gaussian processes (NNGP) model, enabling the application of kriging to large datasets. In contrast, the machine learning-based approach utilizes typical models: extreme gradient boosting (XGBoost), random forest (RF), and deep neural network (DNN). The out-of-sample prediction accuracy of these models was compared using Japanese apartment rent data, with a varying order of sample sizes (i.e., n = 10(4), 10(5), 10(6)). The results showed that, as the sample size increased, XGBoost and RF outperformed NNGP with higher out-of-sample prediction accuracy. XGBoost achieved the highest prediction accuracy for all sample sizes and error measures in both logarithmic and real scales and for all price bands if the distribution of rents is similar in training and test data. A comparison of several methods to account for the spatial dependence in RF showed that simply adding spatial coordinates to the explanatory variables may be sufficient.
引用
收藏
页码:1 / 28
页数:28
相关论文
共 50 条
  • [1] Machine learning-based approaches for cancer prediction using microbiome data
    Freitas, Pedro
    Silva, Francisco
    Sousa, Joana Vale
    Ferreira, Rui M.
    Figueiredo, Ceu
    Pereira, Tania
    Oliveira, Helder P.
    [J]. SCIENTIFIC REPORTS, 2023, 13 (01):
  • [2] Machine learning-based approaches for cancer prediction using microbiome data
    Pedro Freitas
    Francisco Silva
    Joana Vale Sousa
    Rui M. Ferreira
    Céu Figueiredo
    Tania Pereira
    Hélder P. Oliveira
    [J]. Scientific Reports, 13 (1)
  • [3] Machine learning-based approaches for disease gene prediction
    Duc-Hau Le
    [J]. BRIEFINGS IN FUNCTIONAL GENOMICS, 2020, 19 (5-6) : 350 - 363
  • [4] A regression-based machine learning approach for the prediction of lung function decline
    Poulou, Angeliki
    Poulos, Marios
    Panas, Maximilianos
    [J]. 2022 12TH INTERNATIONAL CONFERENCE ON DEPENDABLE SYSTEMS, SERVICES AND TECHNOLOGIES (DESSERT), 2022,
  • [5] Spatial prediction of demersal fish diversity in the Baltic Sea: comparison of machine learning and regression-based techniques
    Smolinski, Szymon
    Radtke, Krzysztof
    [J]. ICES JOURNAL OF MARINE SCIENCE, 2017, 74 (01) : 102 - 111
  • [6] Probabilistic and machine learning-based retrieval approaches for biomedical dataset retrieval
    Karisani, Payam
    Qin, Zhaohui S.
    Agichtein, Eugene
    [J]. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION, 2018,
  • [7] Machine learning-based risk prediction model for cardiovascular disease using a hybrid dataset
    Kanagarathinam, Karthick
    Sankaran, Durairaj
    Manikandan, R.
    [J]. DATA & KNOWLEDGE ENGINEERING, 2022, 140
  • [8] Regression-Based Machine Learning Framework for Customer Churn Prediction in Telecommunication Industry
    Ele, Sylvester Igbo
    Alo, Uzoma Rita
    Nweke, Henry Friday
    Ofem, Ajah Ofem
    [J]. JOURNAL OF ADVANCES IN INFORMATION TECHNOLOGY, 2023, 14 (05) : 1046 - 1055
  • [9] Residential Load Forecasting for Flexibility Prediction Using Machine Learning-Based Regression Model
    Ahmadiahangar, Roya
    Haring, Tobias
    Rosin, Argo
    Korotko, Tarmo
    Martins, Jodo
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ENVIRONMENT AND ELECTRICAL ENGINEERING AND 2019 IEEE INDUSTRIAL AND COMMERCIAL POWER SYSTEMS EUROPE (EEEIC / I&CPS EUROPE), 2019,
  • [10] Regression-Based Prediction of Power Generation at Samanalawewa Hydropower Plant in Sri Lanka Using Machine Learning
    Ekanayake, Piyal
    Wickramasinghe, Lasini
    Jayasinghe, J. M. Jeevani W.
    Rathnayake, Upaka
    [J]. MATHEMATICAL PROBLEMS IN ENGINEERING, 2021, 2021