Accounting for spatial variability with geo-aware random forest: A case study for US major crop mapping

被引：0

作者：

Xie, Yiqun ^{[1
]}

Nhu, Anh N. ^{[1
]}

Song, Xiao-Peng ^{[1
]}

Jia, Xiaowei ^{[2
]}

Skakun, Sergii ^{[1
]}

Li, Haijun ^{[1
]}

Wang, Zhihao ^{[1
]}

机构：

[1] Univ Maryland, 7251 Preinkert Dr, College Pk, MD 20742 USA

[2] Univ Pittsburgh, 210 S Bouquet St, Pittsburgh, PA 15260 USA

来源：

REMOTE SENSING OF ENVIRONMENT | 2025年 / 319卷

基金：

美国国家科学基金会;

关键词：

Random forest; Geo-RF; Spatial variability; Remote sensing; Crop classification; NATIONAL-SCALE; LANDSAT; CLASSIFICATION; PERFORMANCE; REFLECTANCE; SYSTEM;

D O I：

10.1016/j.rse.2024.114585

中图分类号：

X [环境科学、安全科学];

学科分类号：

08 ; 0830 ;

摘要：

Spatial variability has been one of the major challenges for large-area crop monitoring and classification with remote sensing. Recent works on deep learning have introduced spatial transformation methods to automatically partition a heterogeneous region into multiple homogeneous sub-regions during the training process. However, the framework is only designed for deep learning and is not available for other models, e.g., decision tree and random forest, which are frequently the models of choice in many crop mapping products. This paper develops a geo-aware random forest (Geo-RF) model to enable new capabilities to automatically recognize spatial variability during training, partition the space, and learn local models. Specifically, Geo-RF can capture spatial partitions with flexible shapes via an efficient bi-partitioning optimization algorithm. GeoRF also automatically determines the number of partitions needed in a hierarchical manner via statistical tests and builds local RF models along the partitioning process to explicitly address spatial variability and improve classification quality. We used both synthetic and real-world data to evaluate the effectiveness of Geo-RF. First, through the controlled synthetic experiment, Geo-RF demonstrated the ability to capture the artificially-inserted true partition where a different relationship between the inputs and outputs is used. Second, we showed the improvements from Geo-RF using crop classification for five major crops over the contiguous US. The results demonstrated that Geo-RF is able to significantly improve classification performance in sub-regions that are otherwise compromised in a single RF model. For example, the partition around downstream Mississippi for soybean classification led to major improvements for about 0.10-0.25 in F1 scores in the area, and the score increased from 0.57 to 0.82 at certain locations. Similarly, for rice classification, the partition in Arkansas led to F1 scores increasing from 0.59 to 0.88 in local areas. In addition, we evaluated the models under different parameter settings, and the results showed that Geo-RF led to improvements over RF in the vast majority of scenarios (e.g., varying model complexity and training sizes). Computationally, Geo-RF took about one to three times more training time while its execution time during testing was similar to that of RF. Overall, Geo-RF showed the ability to automatically address spatial variability via partitioning optimization, which is an important skill for improving crop classification over heterogeneous geographic areas at large scale. Future research can explore the use of Geo-RF for other geographic regions and applications, interpretable methods to understand the data-driven partitioning, and new designs to further enhance the computational efficiency.

引用

页数：16

共 50 条

[1] Spatial prediction using random forest spatial interpolation with sample augmentation: a case study for precipitation mapping
Sijia, Jiao
Tianjun, Wu
Jiancheng, Luo
Ya'nan, Zhou
Wen, Dong
Changpeng, Wang
Shiying, Dong
EARTH SCIENCE INFORMATICS, 2023, 16 (01) : 863 - 875
[2] Spatial prediction using random forest spatial interpolation with sample augmentation: a case study for precipitation mapping
Jiao Sijia
Wu Tianjun
Luo Jiancheng
Zhou Ya’nan
Dong Wen
Wang Changpeng
Dong Shiying
Earth Science Informatics, 2023, 16 : 863 - 875
[3] Remotely Piloted Aircraft and Random Forest in the Evaluation of the Spatial Variability of Foliar Nitrogen in Coffee Crop
Marin, Diego Bedin
Ferraz, Gabriel Araujo e Silva
Guimaraes, Paulo Henrique Sales
Schwerz, Felipe
Santana, Lucas Santos
Barbosa, Brenon Dienevam Souza
Barata, Rafael Alexandre Pena
Faria, Rafael de Oliveira
Dias, Jessica Ellen Lima
Conti, Leonardo
Rossi, Giuseppe
REMOTE SENSING, 2021, 13 (08)
[4] A random forest-based algorithm for data-intensive spatial interpolation in crop yield mapping
Mariano, Cordoba
Monica, Balzarini
COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2021, 184
[5] Spatial Variability of Forest Species: Case Study for Alto Alentejo, Portugal
Coelho, Ana Margarida
Sousa, Adelia M. O.
Goncalves, Ana Cristina
LAND, 2023, 12 (01)
[6] Accounting for soil respiration variability – Case study in a Mediterranean pine-dominated forest
Ottorino-Luca Pantani
Fabrizio Fioravanti
Federico M. Stefanini
Rossella Berni
Giacomo Certini
Scientific Reports, 10
[7] Accounting for soil respiration variability - Case study in a Mediterranean pine-dominated forest
Pantani, Ottorino-Luca
Fioravanti, Fabrizio
Stefanini, Federico M.
Berni, Rossella
Certini, Giacomo
SCIENTIFIC REPORTS, 2020, 10 (01)
[8] Spatial-temporal patterns of features selected using random forests: a case study of corn and soybeans mapping in the US
Liu, Xiaoxuan
Yu, Le
Zhong, Liheng
Hao, Pengyu
Wu, Bo
Wang, Hongshuo
Yu, Chaoqing
Gong, Peng
INTERNATIONAL JOURNAL OF REMOTE SENSING, 2019, 40 (01) : 269 - 283
[9] Spatial mapping of the social value of forest services: A case study of northern Jordan
Al-Assaf, Amani A.
Al-Asmar, Yolla Y.
Johnsen-Harris, Bart D.
Al-Raggad, Marwan M.
JOURNAL OF SUSTAINABLE FORESTRY, 2016, 35 (07) : 469 - 485
[10] High-Resolution Mining-Induced Geo-Hazard Mapping Using Random Forest: A Case Study of Liaojiaping Orefield, Central China
Qin, Yaozu
Cao, Li
Boloorani, Ali Darvishi
Wu, Weicheng
REMOTE SENSING, 2021, 13 (18)

← 1 2 3 4 5 →