Accounting for spatial variability with geo-aware random forest: A case study for US major crop mapping

被引:0
|
作者
Xie, Yiqun [1 ]
Nhu, Anh N. [1 ]
Song, Xiao-Peng [1 ]
Jia, Xiaowei [2 ]
Skakun, Sergii [1 ]
Li, Haijun [1 ]
Wang, Zhihao [1 ]
机构
[1] Univ Maryland, 7251 Preinkert Dr, College Pk, MD 20742 USA
[2] Univ Pittsburgh, 210 S Bouquet St, Pittsburgh, PA 15260 USA
基金
美国国家科学基金会;
关键词
Random forest; Geo-RF; Spatial variability; Remote sensing; Crop classification; NATIONAL-SCALE; LANDSAT; CLASSIFICATION; PERFORMANCE; REFLECTANCE; SYSTEM;
D O I
10.1016/j.rse.2024.114585
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Spatial variability has been one of the major challenges for large-area crop monitoring and classification with remote sensing. Recent works on deep learning have introduced spatial transformation methods to automatically partition a heterogeneous region into multiple homogeneous sub-regions during the training process. However, the framework is only designed for deep learning and is not available for other models, e.g., decision tree and random forest, which are frequently the models of choice in many crop mapping products. This paper develops a geo-aware random forest (Geo-RF) model to enable new capabilities to automatically recognize spatial variability during training, partition the space, and learn local models. Specifically, Geo-RF can capture spatial partitions with flexible shapes via an efficient bi-partitioning optimization algorithm. GeoRF also automatically determines the number of partitions needed in a hierarchical manner via statistical tests and builds local RF models along the partitioning process to explicitly address spatial variability and improve classification quality. We used both synthetic and real-world data to evaluate the effectiveness of Geo-RF. First, through the controlled synthetic experiment, Geo-RF demonstrated the ability to capture the artificially-inserted true partition where a different relationship between the inputs and outputs is used. Second, we showed the improvements from Geo-RF using crop classification for five major crops over the contiguous US. The results demonstrated that Geo-RF is able to significantly improve classification performance in sub-regions that are otherwise compromised in a single RF model. For example, the partition around downstream Mississippi for soybean classification led to major improvements for about 0.10-0.25 in F1 scores in the area, and the score increased from 0.57 to 0.82 at certain locations. Similarly, for rice classification, the partition in Arkansas led to F1 scores increasing from 0.59 to 0.88 in local areas. In addition, we evaluated the models under different parameter settings, and the results showed that Geo-RF led to improvements over RF in the vast majority of scenarios (e.g., varying model complexity and training sizes). Computationally, Geo-RF took about one to three times more training time while its execution time during testing was similar to that of RF. Overall, Geo-RF showed the ability to automatically address spatial variability via partitioning optimization, which is an important skill for improving crop classification over heterogeneous geographic areas at large scale. Future research can explore the use of Geo-RF for other geographic regions and applications, interpretable methods to understand the data-driven partitioning, and new designs to further enhance the computational efficiency.
引用
收藏
页数:16
相关论文
共 50 条
  • [31] Spatially Explicit Mapping of Historical Population Density with Random Forest Regression: A Case Study of Gansu Province, China, in 1820 and 2000
    Wang, Fahao
    Lu, Weidong
    Zheng, Jingyun
    Li, Shicheng
    Zhang, Xuezhen
    SUSTAINABILITY, 2020, 12 (03)
  • [32] Local Population Mapping Using a Random Forest Model Based on Remote and Social Sensing Data: A Case Study in Zhengzhou, China
    Qiu, Ge
    Bao, Yuhai
    Yang, Xuchao
    Wang, Chen
    Ye, Tingting
    Stein, Alfred
    Jia, Peng
    REMOTE SENSING, 2020, 12 (10)
  • [33] Mapping Dryland Ecosystems Using Google Earth Engine and Random Forest: A Case Study of an Ecologically Critical Area in Northern China
    Li, Shuai
    Guo, Pu
    Sun, Fei
    Zhu, Jinlei
    Cao, Xiaoming
    Dong, Xue
    Lu, Qi
    LAND, 2024, 13 (06)
  • [34] Spatial Susceptibility Assessment of Landslides Based on Random Forest: A Case Study from Hubei Section in the Three Gorges Reservoir Area
    Wu R.
    Hu X.
    Mei H.
    He J.
    Yang J.
    Diqiu Kexue - Zhongguo Dizhi Daxue Xuebao/Earth Science - Journal of China University of Geosciences, 2021, 46 (01): : 321 - 330
  • [35] Sentinel-1 and 2 Time-Series for Vegetation Mapping Using Random Forest Classification: A Case Study of Northern Croatia
    Dobrinic, Dino
    Gasparovic, Mateo
    Medak, Damir
    REMOTE SENSING, 2021, 13 (12)
  • [36] MAPPING BENTHIC HABITAT FROM WORLDVIEW-3 IMAGE USING RANDOM FOREST CASE STUDY: NUSA LEMBONGAN, BALI, INDONESIA
    Ginting, Devica Natalia Br
    Wicaksono, Pramaditya
    Farda, Nur Mohammad
    GEOINFORMATION WEEK 2022, VOL. 48-4, 2023, : 123 - 129
  • [37] An innovative method for landslide susceptibility mapping supported by fractal theory, GeoDetector, and random forest: a case study in Sichuan Province, SW China
    Zhuo Chen
    Danqing Song
    Lihu Dong
    Natural Hazards, 2023, 118 : 2543 - 2568
  • [38] An innovative method for landslide susceptibility mapping supported by fractal theory, GeoDetector, and random forest: a case study in Sichuan Province, SW China
    Chen, Zhuo
    Song, Danqing
    Dong, Lihu
    NATURAL HAZARDS, 2023, 118 (03) : 2543 - 2568
  • [39] Geo-spatial approach for land-use and land-cover changes and deforestation mapping: a case study of Ankasha Guagusa, Northwestern, Ethiopia
    Mekasha, Samson Tsegaye
    Suryabhagavan, K. V.
    Gebrehiwot, Mersha
    TROPICAL ECOLOGY, 2020, 61 (04) : 550 - 569
  • [40] Geo-spatial approach for land-use and land-cover changes and deforestation mapping: a case study of Ankasha Guagusa, Northwestern, Ethiopia
    Samson Tsegaye Mekasha
    K. V. Suryabhagavan
    Mersha Gebrehiwot
    Tropical Ecology, 2020, 61 : 550 - 569