A novel intelligence approach based active and ensemble learning for agricultural soil organic carbon prediction using multispectral and SAR data fusion

被引:67
|
作者
Thu Thuy Nguyen [1 ]
Tien Dat Pham [2 ,4 ]
Chi Trung Nguyen [3 ]
Delfos, Jacob [4 ]
Archibald, Robert [4 ]
Kinh Bac Dang [5 ]
Ngoc Bich Hoang [6 ]
Guo, Wenshan [1 ]
Huu Hao Ngo [1 ,6 ]
机构
[1] Univ Technol Sydney, Ctr Technol Water & Wastewater, Sch Civil & Environm Engn, Sydney, NSW 2007, Australia
[2] Macquarie Univ, Dept Earth & Environm Sci, N Ryde, NSW 2109, Australia
[3] Univ New England, Fac Sci Agr Business & Law, UNE Business Sch, Elm Ave, Armidale, NSW 2351, Australia
[4] Astron Environm Serv, 129 Royal St, East Perth, WA 6004, Australia
[5] VNU Univ Sci, Fac Geog, 334 Nguyen Trai, Hanoi, Vietnam
[6] Nguyen Tat Thanh Univ, Inst Environm Sci, Ho Chi Minh City, Vietnam
关键词
SOC; Machine learning; Multi-sensor data fusion; Sentinel; 1; 2; SENTINEL-2; VEGETATION; INDEXES; AIRBORNE; TEXTURE; BANDS; RED;
D O I
10.1016/j.scitotenv.2021.150187
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Monitoring agricultural soil organic carbon (SOC) has played an essential role in sustainable agricultural management. Precise and robust prediction of SOC greatly contributes to carbon neutrality in the agricultural industry. To create more knowledge regarding the ability of remote sensing to monitor carbon soil, this research devises a state-of-the-art low cost machine learning model for quantifying agricultural soil carbon using active and ensemble-based decision tree learning combined with multi-sensor data fusion at a national and world scale. This work explores the use of Sentinel-1 (S1) C-band dual polarimetric synthetic aperture radar (SAR), Sentinel-2 (S2) multispectral data, and an innovative machine learning (ML) approach using an integration of active learning for land-use mapping and advanced Extreme Gradient Boosting (XGBoost) for robustness of the SOC estimates. The collected soil samples from a field survey in Western Australia were used for the model validation. The indicators including the coefficient of determination (R-2) and root - mean - square - error (RMSE) were applied to evaluate the model's performance. A numerous features computed from optical and SAR data fusion were employed to build and test the proposed model performance. The effectiveness of the proposed machine learning model was assessed by comparing with the two well-known algorithms such as Random Forests (RF) and Support Vector Machine (SVM) to predict agricultural SOC. Results suggest that a combination of S1 and S2 sensors could effectively estimate SOC in farming areas by using ML techniques. Satisfactory accuracy of the proposed XGBoost with optimal features was achieved the highest performance (R-2 = 0.870; RMSE - 1.818 tonC/ha) which outperformed RF and SVM. Thus, multi-sensor data fusion combined with the XGBoost lead to the best prediction results for agricultural SOC at 10 m spatial resolution. In short, this new approach could significantly contribute to various agricultural SOC retrieval studies globally. (C) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] A novel ensemble approach for heterogeneous data with active learning
    Salama, Mohamed
    Abdelkader, Hatem
    Abdelwahab, Amira
    [J]. INTERNATIONAL JOURNAL OF ENGINEERING BUSINESS MANAGEMENT, 2022, 14
  • [2] A novel ensemble learning approach to extract urban impervious surface based on machine learning algorithms using SAR and optical data
    Ahmad, Muhammad Nasar
    Shao, Zhenfeng
    Xiao, Xiongwu
    Fu, Peng
    Javed, Akib
    Ara, Iffat
    [J]. INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2024, 132
  • [3] Machine Learning for Soil Moisture Prediction Using Hyperspectral and Multispectral Data
    Lobato, Michaela
    Norris, William Robert
    Nagi, Rakesh
    Soylemezoglu, Ahmet
    Nottage, Dustin
    [J]. 2021 IEEE 24TH INTERNATIONAL CONFERENCE ON INFORMATION FUSION (FUSION), 2021, : 696 - 702
  • [4] Dynamic ensemble approach for estimating organic carbon using computational intelligence
    Spencer, Matthew J.
    Whitfort, Tim
    McCullagh, John
    Bui, Elisabeth
    [J]. PROCEEDINGS OF THE IASTED INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTER SCIENCE AND TECHNOLOGY, 2006, : 186 - +
  • [5] An ensemble-based incremental learning approach to data fusion
    Parikh, Devi
    Polikar, Robi
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2007, 37 (02): : 437 - 450
  • [6] Soil Organic Carbon Mapping Using Multispectral Remote Sensing Data: Prediction Ability of Data with Different Spatial and Spectral Resolutions
    Zizala, Daniel
    Minarik, Robert
    Zadorova, Tereza
    [J]. REMOTE SENSING, 2019, 11 (24)
  • [7] A novel data-driven approach for residential electricity consumption prediction based on ensemble learning
    Chen, Kunlong
    Jiang, Jiuchun
    Zheng, Fangdan
    Chen, Kunjin
    [J]. ENERGY, 2018, 150 : 49 - 60
  • [8] A novel approach to map soil organic carbon content using spectroscopic and environmental data
    Rial, Marcela
    Martinez Cortizas, Antonio
    Rodriguez-Lado, Luis
    [J]. SPATIAL STATISTICS CONFERENCE 2015, PART 2, 2015, 27 : 49 - 52
  • [9] A novel data repairing approach based on constraints and ensemble learning
    Ataeyan, Mahdieh
    Daneshpour, Negin
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2020, 159
  • [10] An Active Learning Approach for Ensemble-based Data Stream Mining
    Alabdulrahman, Rabaa
    Viktor, Herna
    Paquet, Eric
    [J]. KDIR: PROCEEDINGS OF THE 8TH INTERNATIONAL JOINT CONFERENCE ON KNOWLEDGE DISCOVERY, KNOWLEDGE ENGINEERING AND KNOWLEDGE MANAGEMENT - VOL. 1, 2016, : 275 - 282