Accounting for spatial variability with geo-aware random forest: A case study for US major crop mapping

被引:0
|
作者
Xie, Yiqun [1 ]
Nhu, Anh N. [1 ]
Song, Xiao-Peng [1 ]
Jia, Xiaowei [2 ]
Skakun, Sergii [1 ]
Li, Haijun [1 ]
Wang, Zhihao [1 ]
机构
[1] Univ Maryland, 7251 Preinkert Dr, College Pk, MD 20742 USA
[2] Univ Pittsburgh, 210 S Bouquet St, Pittsburgh, PA 15260 USA
基金
美国国家科学基金会;
关键词
Random forest; Geo-RF; Spatial variability; Remote sensing; Crop classification; NATIONAL-SCALE; LANDSAT; CLASSIFICATION; PERFORMANCE; REFLECTANCE; SYSTEM;
D O I
10.1016/j.rse.2024.114585
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Spatial variability has been one of the major challenges for large-area crop monitoring and classification with remote sensing. Recent works on deep learning have introduced spatial transformation methods to automatically partition a heterogeneous region into multiple homogeneous sub-regions during the training process. However, the framework is only designed for deep learning and is not available for other models, e.g., decision tree and random forest, which are frequently the models of choice in many crop mapping products. This paper develops a geo-aware random forest (Geo-RF) model to enable new capabilities to automatically recognize spatial variability during training, partition the space, and learn local models. Specifically, Geo-RF can capture spatial partitions with flexible shapes via an efficient bi-partitioning optimization algorithm. GeoRF also automatically determines the number of partitions needed in a hierarchical manner via statistical tests and builds local RF models along the partitioning process to explicitly address spatial variability and improve classification quality. We used both synthetic and real-world data to evaluate the effectiveness of Geo-RF. First, through the controlled synthetic experiment, Geo-RF demonstrated the ability to capture the artificially-inserted true partition where a different relationship between the inputs and outputs is used. Second, we showed the improvements from Geo-RF using crop classification for five major crops over the contiguous US. The results demonstrated that Geo-RF is able to significantly improve classification performance in sub-regions that are otherwise compromised in a single RF model. For example, the partition around downstream Mississippi for soybean classification led to major improvements for about 0.10-0.25 in F1 scores in the area, and the score increased from 0.57 to 0.82 at certain locations. Similarly, for rice classification, the partition in Arkansas led to F1 scores increasing from 0.59 to 0.88 in local areas. In addition, we evaluated the models under different parameter settings, and the results showed that Geo-RF led to improvements over RF in the vast majority of scenarios (e.g., varying model complexity and training sizes). Computationally, Geo-RF took about one to three times more training time while its execution time during testing was similar to that of RF. Overall, Geo-RF showed the ability to automatically address spatial variability via partitioning optimization, which is an important skill for improving crop classification over heterogeneous geographic areas at large scale. Future research can explore the use of Geo-RF for other geographic regions and applications, interpretable methods to understand the data-driven partitioning, and new designs to further enhance the computational efficiency.
引用
收藏
页数:16
相关论文
共 50 条
  • [21] Mapping spatial variability in shoreline change hotspots from satellite data; a case study in southeast Australia
    Konlechner, Teresa M.
    Kennedy, David M.
    O'Grady, Julian J.
    Leach, Chloe
    Ranasinghe, Roshanka
    Carvalho, Rafael C.
    Luijendijk, Arjen P.
    McInnes, Kathleen L.
    Ierodiaconou, Daniel
    ESTUARINE COASTAL AND SHELF SCIENCE, 2020, 246
  • [22] On the Importance of Training Data Sample Selection in Random Forest Image Classification: A Case Study in Peatland Ecosystem Mapping
    Millard, Koreen
    Richardson, Murray
    REMOTE SENSING, 2015, 7 (07) : 8489 - 8515
  • [23] Modeling the Spatial Distribution of Population Based on Random Forest and Parameter Optimization Methods: A Case Study of Sichuan, China
    Chen, Yunzhou
    Wang, Shumin
    Gu, Ziying
    Yang, Fan
    APPLIED SCIENCES-BASEL, 2024, 14 (01):
  • [24] Feature Selection of Time Series MODIS Data for Early Crop Classification Using Random Forest: A Case Study in Kansas, USA
    Hao, Pengyu
    Zhan, Yulin
    Wang, Li
    Niu, Zheng
    Shakir, Muhammad
    REMOTE SENSING, 2015, 7 (05) : 5347 - 5369
  • [25] Forest Type Classification Based on Integrated Spectral-Spatial-Temporal Features and Random Forest Algorithm-A Case Study in the Qinling Mountains
    Cheng, Kai
    Wang, Juanle
    FORESTS, 2019, 10 (07):
  • [26] Spatial variability of evapotranspiration of old-growth cypress forest using remote sensing - a case study of Chilan Mountain cypress forest in Taiwan
    Wu, Chih-Da
    Cheng, Chi-Chuan
    Chuang, Yung-Chung
    CANADIAN JOURNAL OF FOREST RESEARCH, 2012, 42 (06) : 1060 - 1071
  • [27] Characterization and mapping of photovoltaic solar power plants by Landsat imagery and random forest: A case study in Gansu Province, China
    Wang, Xinxin
    Xiao, Xiangming
    Zhang, Xi
    Ye, Hui
    Dong, Jinwei
    He, Qiang
    Wang, Xubang
    Liu, Jianquan
    Li, Bo
    Wu, Jihua
    JOURNAL OF CLEANER PRODUCTION, 2023, 417
  • [28] Prediction of the spatial distribution of soil arthropods using a random forest model: A case study in Changtu County, Northeast China
    Guo, Xiaoyu
    Bian, Zhenxing
    Wang, Shuai
    Wang, Qiubing
    Zhang, Yufei
    Zhou, Jun
    Lin, Lin
    AGRICULTURE ECOSYSTEMS & ENVIRONMENT, 2020, 292
  • [29] Crop Mapping Based on Temporal and Spatial Sample Migrations: A Case Study Over Three Counties in Heilongjiang Province, Northeast China
    Zuo, Hao-Nan
    Leng, Pei
    Li, Yu-Xuan
    Song, Qian
    Li, Zhao-Liang
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 14630 - 14639
  • [30] Vegetation Mapping with Random Forest Using Sentinel 2 and GLCM Texture Feature-A Case Study for Lousa Region, Portugal
    Mohammadpour, Pegah
    Viegas, Domingos Xavier
    Viegas, Carlos
    REMOTE SENSING, 2022, 14 (18)