Random Forest Variable Importance Measures for Spatial Dynamics: Case Studies from Urban Demography

被引:2
|
作者
Georgati, Marina [1 ]
Hansen, Henning Sten [1 ]
Kessler, Carsten [1 ,2 ]
机构
[1] Aalborg Univ, Dept Sustainabil & Planning, DK-2450 Copenhagen, Denmark
[2] Bochum Univ Appl Sci, Dept Geodesy, D-44801 Bochum, Germany
基金
欧盟地平线“2020”;
关键词
residential distribution; machine learning; population dynamics; gridded data; GeoAI; RESIDENTIAL LOCATION CHOICE; NON-EUROPEAN MIGRANTS; ETHNIC SEGREGATION; NEIGHBORHOOD CONCENTRATION; NETHERLANDS; PATTERNS; FAMILY; MODEL; REPRESENTATION; ASSIMILATION;
D O I
10.3390/ijgi12110460
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Population growth in urban centres and the intensification of segregation phenomena associated with international mobility require improved urban planning and decision-making. More effective planning in turn requires better analysis and geospatial modelling of residential locations, along with a deeper understanding of the factors that drive the spatial distribution of various migrant groups. This study examines the factors that influence the distribution of migrants at the local level and evaluates their importance using machine learning, specifically the variable importance measures produced by the random forest algorithm. It is conducted on high spatial resolution ( 100 x 100 grid cells) register data in Amsterdam and Copenhagen, using demographic, housing and neighbourhood attributes for 2018. The results distinguish the ethnic and demographic composition of a location as an important factor in the residential distribution of migrants in both cities. We also examine whether certain migrant groups pay higher prices in the most attractive areas, using spatial statistics and mapping for 2008 and 2018. We find evidence of segregation in both cities, with Western migrants having higher purchasing power than non-Western migrants in both years. The method sheds light on the determinants of migrant distribution in destination cities and advances our understanding of the application of geospatial artificial intelligence to urban dynamics and population movements.
引用
收藏
页数:24
相关论文
共 50 条
  • [1] Empirical characterization of random forest variable importance measures
    Archer, Kelfie J.
    Kirnes, Ryan V.
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2008, 52 (04) : 2249 - 2260
  • [2] Letter to the Editor: On the stability and ranking of predictors from random forest variable importance measures
    Nicodemus, Kristin K.
    BRIEFINGS IN BIOINFORMATICS, 2011, 12 (04) : 369 - 373
  • [3] An experimental study of the intrinsic stability of random forest variable importance measures
    Huazhen Wang
    Fan Yang
    Zhiyuan Luo
    BMC Bioinformatics, 17
  • [4] Bias in random forest variable importance measures: Illustrations, sources and a solution
    Strobl, Carolin
    Boulesteix, Anne-Laure
    Zeileis, Achim
    Hothorn, Torsten
    BMC BIOINFORMATICS, 2007, 8 (1)
  • [5] An experimental study of the intrinsic stability of random forest variable importance measures
    Wang, Huazhen
    Yang, Fan
    Luo, Zhiyuan
    BMC BIOINFORMATICS, 2016, 17
  • [6] Bias in random forest variable importance measures: Illustrations, sources and a solution
    Carolin Strobl
    Anne-Laure Boulesteix
    Achim Zeileis
    Torsten Hothorn
    BMC Bioinformatics, 8
  • [8] Dissolved oxygen prediction model based on variable importance measures and random forest: A case study of Shenzhen Bay
    Yang, Ming-Yue
    Mao, Xian-Zhong
    Zhongguo Huanjing Kexue/China Environmental Science, 2022, 42 (08): : 3876 - 3881
  • [9] VARIABLE INTERACTION MEASURES WITH RANDOM FOREST CLASSIFIERS
    Kelly, Cassidy
    Okada, Kazunori
    2012 9TH IEEE INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (ISBI), 2012, : 154 - 157
  • [10] Estimating neuronal variable importance with random forest
    Oh, J
    Laubach, M
    Luczak, A
    PROCEEDINGS OF THE IEEE 29TH ANNUAL NORTHEAST BIOENGINEERING CONFERENCE, 2003, : 33 - 34