Machine-learning models for spatially-explicit forecasting of future racial segregation in US cities

被引:4
|
作者
Stepinski, Tomasz F. [1 ]
Dmowska, Anna [2 ]
机构
[1] Univ Cincinnati, Space Informat Lab, Cincinnati, OH 45221 USA
[2] Adam Mickiewicz Univ, Inst Geoecol & Geoinformat, Poznan, Poland
来源
关键词
Spatially-explicit forecasting; Supervised learning; Social computing; Racial segregation; RESIDENTIAL SEGREGATION; AREAL INTERPOLATION; INFORMATION-THEORY; DYNAMIC-MODELS; POPULATION; CENSUS;
D O I
10.1016/j.mlwa.2022.100359
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Residential racial segregation in large US cities is a complex phenomenon with important social, political, and economic ramifications. In this paper, we demonstrate that the prediction of future segregation can be achieved by using an empirical model generated by a machine learning (ML) algorithm. Specifically, we predict a future map of neighborhood types - racial compositions quantized to several archetypes. Within such a framework, the prediction of segregation is tantamount to the prediction of a thematic map of future neighborhood types. An ML model of change is trained on historical changes and used to make predictions. The key predicate of an ML model is the choice of attributes - variables that drive the change. We hypothesize that neighborhood type's change of a spatial unit depends only on its present type and statistics of types in surrounding units. The paper asks and positively answers three questions. Is our hypothesis validated by the results? Does the proposed methodology yield useful predictions? Do our results agree with competing predictions? To answer these questions we train and validate a number of change models using, as the case study, 1990, 2000, 2010, and 2020 US Census Bureau block -level data for Cook County, IL (Chicago). We investigated four different algorithms, Random Forest, Gradient Boosted Trees, Neural Network, and Self -Normalizing Net, and have found that Gradient Boosted Trees (GBT) yields the best predictions. Using the GBT-generated model we make a prediction of residential segregation in Cook County in the year 2030.
引用
收藏
页数:10
相关论文
共 24 条
  • [21] Spatial Models or Random Forest? Evaluating the Use of Spatially Explicit Machine Learning Methods to Predict Employment Density around New Transit Stations in Los Angeles
    Credit, Kevin
    GEOGRAPHICAL ANALYSIS, 2022, 54 (01) : 58 - 83
  • [22] How good are different machine and deep learning models in forecasting the future price of metals? Full sample versus sub-sample
    Varshini, Anu
    Kayal, Parthajit
    Maiti, Moinak
    RESOURCES POLICY, 2024, 92
  • [23] Comparative Performance Assessment of Physical-Based and Data-Driven Machine-Learning Models for Simulating Streamflow: A Case Study in Three Catchments across the US
    Jin, Aohan
    Wang, Quanrong
    Zhan, Hongbin
    Zhou, Renjie
    JOURNAL OF HYDROLOGIC ENGINEERING, 2024, 29 (02)
  • [24] Improvement of Time Forecasting Models Using Machine Learning for Future Pandemic Applications Based on COVID-19 Data 2020-2022
    Hamid, Abdul Aziz K. Abdul
    Nawi, Wan Imanul Aisyah Wan Mohamad
    Lola, Muhamad Safiih
    Mustafa, Wan Azani
    Malik, Siti Madhihah Abdul
    Zakaria, Syerrina
    Aruchunan, Elayaraja
    Zainuddin, Nurul Hila
    Gobithaasan, R. U.
    Abdullah, Mohd Tajuddin
    DIAGNOSTICS, 2023, 13 (06)