Using multiple linear regression and random forests to identify spatial poverty determinants in rural China

被引:46
|
作者
Liu, Mengxiao [1 ,2 ]
Hu, Shan [1 ]
Ge, Yong [1 ,2 ]
Heuvelink, Gerard B. M. [3 ,4 ]
Ren, Zhoupeng [1 ]
Huang, Xiaoran [1 ]
机构
[1] Chinese Acad Sci, Inst Geog Sci & Nat Resources Res, State Key Lab Resources & Environm Informat Syst, Beijing 100101, Peoples R China
[2] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
[3] Wageningen Univ, Soil Geog & Landscape Grp, POB 47, NL-6700 AA Wageningen, Netherlands
[4] ISRIC World Soil Informat, POB 353, NL-6700 AJ Wageningen, Netherlands
关键词
Poverty; Spatial; Determinants; LMG; Random forest; Variable importance; RELATIVE IMPORTANCE; VARIABLE IMPORTANCE; ALLEVIATION; IDENTIFICATION; PREDICTORS; INDICATORS; EROSION; ACCESS; SLOPE;
D O I
10.1016/j.spasta.2020.100461
中图分类号
P [天文学、地球科学];
学科分类号
07 ;
摘要
Identifying poverty determinants in a region is crucial for taking effective poverty reduction measures. This paper utilizes two variable importance analysis methods to identify the relative importance of different geographic factors to explain the spatial distribution of poverty: the Lindeman, Merenda, and Gold (LMG) method used in multiple linear regression (MLR) and variable importance used in random forest (RF) machine learning. A case study was conducted in Yunyang, a poverty-stricken county in China, to evaluate the performances of the two methods for identifying village-level poverty determinants. The results indicated that: (1) MLR and RF had similar explanation accuracy; (2) LMG and RF were consistent in the three main determinants of poverty; (3) LMG better identified the importance of variables that were highly related to poverty but correlated with other variables, while RF better identified the non-linear relationships between poverty and explanatory variables; (4) accessibility metrics are the most important variables influencing poverty in Yunyang and have a linear relationship with poverty. (C) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页数:19
相关论文
共 50 条
  • [41] PREDICTION SPATIAL PATTERNS OF WINDTHROW PHENOMENON IN DECIDUOUS TEMPERATE FORESTS USING LOGISTIC REGRESSION AND RANDOM FOREST
    Shabani, Saeid
    Akbarinia, Moslem
    [J]. CERNE, 2017, 23 (03) : 387 - 394
  • [42] Social Deprivation and Rural Public Health in China: Exploring the Relationship Using Spatial Regression
    Heyuan You
    Deshao Zhou
    Shenyan Wu
    Xiaowei Hu
    Chenmeng Bie
    [J]. Social Indicators Research, 2020, 147 : 843 - 864
  • [43] Social Deprivation and Rural Public Health in China: Exploring the Relationship Using Spatial Regression
    You, Heyuan
    Zhou, Deshao
    Wu, Shenyan
    Hu, Xiaowei
    Bie, Chenmeng
    [J]. SOCIAL INDICATORS RESEARCH, 2020, 147 (03) : 843 - 864
  • [44] Multiple trait and random regression models using linear splines for genetic evaluation of multiple breed populations
    Ribeiro, V. M. P.
    Raidan, F. S. S.
    Barbosa, A. R.
    Silva, M. V. G. B.
    Cardoso, F. F.
    Toral, F. L. B.
    [J]. JOURNAL OF DAIRY SCIENCE, 2019, 102 (01) : 464 - 475
  • [45] Comparative Models of Price Estimation Using Multiple Linear Regression and Random Forest Methods
    Crosss Sihombing, Denny Jean
    Othernima, Desi C.
    Manurung, Jonson
    Sagala, Jijon Raphita
    [J]. ICCoSITE 2023 - International Conference on Computer Science, Information Technology and Engineering: Digital Transformation Strategy in Facing the VUCA and TUNA Era, 2023, : 478 - 483
  • [46] Using spatial multiple regression to identify intrinsic connectivity networks involved in working memory performance
    Gordon, Evan M.
    Stollstorff, Melanie
    Vaidya, Chandan J.
    [J]. HUMAN BRAIN MAPPING, 2012, 33 (07) : 1536 - 1552
  • [47] Spatial Interpolation using Multiple Regression
    Ohashi, Orlando
    Torgo, Luis
    [J]. 12TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM 2012), 2012, : 1044 - 1049
  • [48] MEG analysis with spatial filter and multiple linear regression
    Okawa, S
    Honda, S
    [J]. SICE 2004 ANNUAL CONFERENCE, VOLS 1-3, 2004, : 1453 - 1457
  • [49] A new approach in adsorption modeling using random forest regression, Bayesian multiple linear regression, and multiple linear regression: 2,4-D adsorption by a green adsorbent
    Beigzadeh, Bahareh
    Bahrami, Mehdi
    Amiri, Mohammad Javad
    Mahmoudi, Mohammad Reza
    [J]. WATER SCIENCE AND TECHNOLOGY, 2020, 82 (08) : 1586 - 1602
  • [50] Water and poverty in rural China: Developing an instrument to assess the multiple dimensions of water and poverty
    Cohen, Alasdair
    Sullivan, Caroline A.
    [J]. ECOLOGICAL ECONOMICS, 2010, 69 (05) : 999 - 1009