Accounting for spatial variability with geo-aware random forest: A case study for US major crop mapping

被引:0
|
作者
Xie, Yiqun [1 ]
Nhu, Anh N. [1 ]
Song, Xiao-Peng [1 ]
Jia, Xiaowei [2 ]
Skakun, Sergii [1 ]
Li, Haijun [1 ]
Wang, Zhihao [1 ]
机构
[1] Univ Maryland, 7251 Preinkert Dr, College Pk, MD 20742 USA
[2] Univ Pittsburgh, 210 S Bouquet St, Pittsburgh, PA 15260 USA
基金
美国国家科学基金会;
关键词
Random forest; Geo-RF; Spatial variability; Remote sensing; Crop classification; NATIONAL-SCALE; LANDSAT; CLASSIFICATION; PERFORMANCE; REFLECTANCE; SYSTEM;
D O I
10.1016/j.rse.2024.114585
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Spatial variability has been one of the major challenges for large-area crop monitoring and classification with remote sensing. Recent works on deep learning have introduced spatial transformation methods to automatically partition a heterogeneous region into multiple homogeneous sub-regions during the training process. However, the framework is only designed for deep learning and is not available for other models, e.g., decision tree and random forest, which are frequently the models of choice in many crop mapping products. This paper develops a geo-aware random forest (Geo-RF) model to enable new capabilities to automatically recognize spatial variability during training, partition the space, and learn local models. Specifically, Geo-RF can capture spatial partitions with flexible shapes via an efficient bi-partitioning optimization algorithm. GeoRF also automatically determines the number of partitions needed in a hierarchical manner via statistical tests and builds local RF models along the partitioning process to explicitly address spatial variability and improve classification quality. We used both synthetic and real-world data to evaluate the effectiveness of Geo-RF. First, through the controlled synthetic experiment, Geo-RF demonstrated the ability to capture the artificially-inserted true partition where a different relationship between the inputs and outputs is used. Second, we showed the improvements from Geo-RF using crop classification for five major crops over the contiguous US. The results demonstrated that Geo-RF is able to significantly improve classification performance in sub-regions that are otherwise compromised in a single RF model. For example, the partition around downstream Mississippi for soybean classification led to major improvements for about 0.10-0.25 in F1 scores in the area, and the score increased from 0.57 to 0.82 at certain locations. Similarly, for rice classification, the partition in Arkansas led to F1 scores increasing from 0.59 to 0.88 in local areas. In addition, we evaluated the models under different parameter settings, and the results showed that Geo-RF led to improvements over RF in the vast majority of scenarios (e.g., varying model complexity and training sizes). Computationally, Geo-RF took about one to three times more training time while its execution time during testing was similar to that of RF. Overall, Geo-RF showed the ability to automatically address spatial variability via partitioning optimization, which is an important skill for improving crop classification over heterogeneous geographic areas at large scale. Future research can explore the use of Geo-RF for other geographic regions and applications, interpretable methods to understand the data-driven partitioning, and new designs to further enhance the computational efficiency.
引用
收藏
页数:16
相关论文
共 50 条
  • [41] Spatial and long-term variability of soil loss due to crop harvesting and the importance relative to water erosion: A case study from Belgium
    Ruysschaert, G.
    Poesen, J.
    Notebaert, B.
    Verstraeten, G.
    Govers, G.
    AGRICULTURE ECOSYSTEMS & ENVIRONMENT, 2008, 126 (3-4) : 217 - 228
  • [42] Intraspecific functional trait variability across different spatial scales: a case study of two dominant trees in Korean pine broadleaved forest
    Tingting Li
    Jian Wu
    Hua Chen
    Lanzhu Ji
    Dapao Yu
    Li Zhou
    Wangming Zhou
    Yuewei Tong
    Yinghua Li
    Limin Dai
    Plant Ecology, 2018, 219 : 875 - 886
  • [43] Intraspecific functional trait variability across different spatial scales: a case study of two dominant trees in Korean pine broadleaved forest
    Li, Tingting
    Wu, Jian
    Chen, Hua
    Ji, Lanzhu
    Yu, Dapao
    Zhou, Li
    Zhou, Wangming
    Tong, Yuewei
    Li, Yinghua
    Dai, Limin
    PLANT ECOLOGY, 2018, 219 (08) : 875 - 886
  • [44] Mapping flood prone and Hazards Areas in rural landscape using landsat images and random forest classification: Case study of Nasia watershed in Ghana
    Ghansah, Benjamin
    Nyamekye, Clement
    Owusu, Seth
    Agyapong, Emmanuel
    COGENT ENGINEERING, 2021, 8 (01):
  • [45] Land-cover mapping using Random Forest classification and incorporating NDVI time-series and texture: a case study of central Shandong
    Jin, Yuhao
    Liu, Xiaoping
    Chen, Yimin
    Liang, Xun
    INTERNATIONAL JOURNAL OF REMOTE SENSING, 2018, 39 (23) : 8703 - 8723
  • [46] Impact of climatic variables on the spatial and temporal variability of crop yield and biomass gap in Sub-Saharan Africa-a case study in Central Ghana
    Srivastava, Amit Kumar
    Mboh, Cho Miltin
    Gaiser, Thomas
    Ewert, Frank
    FIELD CROPS RESEARCH, 2017, 203 : 33 - 46
  • [47] Enhancing spatial resolution of drought monitoring through a novel random forest-based GRACE drought index: a case study in Central Yunnan
    Wang, Xia
    Zheng, Wei
    Yin, Wenjie
    Xu, Keke
    Zhang, Hebing
    Lei, Weiwei
    GEOCARTO INTERNATIONAL, 2024, 39 (01)
  • [48] Selective logging mapping in the Brazilian Amazon using high spatial resolution planet imagery and artificial intelligence: A case study in the Jamari National Forest
    Braga, Daniel
    Dalagnol, Ricardo
    Ribeiro, Celso B.M.
    Anderson, Liana O.
    Aragão, Luiz E.O.C.
    Proceedings of the Brazilian Symposium on GeoInformatics, 2022, : 204 - 210
  • [49] Application of GIS-based data driven random forest and maximum entropy models for groundwater potential mapping: A case study at Mehran Region, Iran
    Rahmati, Omid
    Pourghasemi, Hamid Reza
    Melesse, Assefa M.
    CATENA, 2016, 137 : 360 - 372
  • [50] Geo-structural Analysis Accompanied by GIS Vulnerability Mapping Validated by Hydro-chemical Modeling in Determining Spatial Expansion of Landfills: Case Study from Jordan
    Al-Farajat, Mohammed
    Diabat, Abdullah
    Al-Adamat, Rida
    Al-Amoush, Hani
    JORDAN JOURNAL OF CIVIL ENGINEERING, 2016, 10 (03) : 367 - 389