Leaf Area Index Estimation Algorithm for GF-5 Hyperspectral Data Based on Different Feature Selection and Machine Learning Methods

被引:53
|
作者
Chen, Zhulin [1 ,2 ]
Jia, Kun [1 ,2 ]
Xiao, Chenchao [3 ]
Wei, Dandan [3 ]
Zhao, Xiang [1 ,2 ]
Lan, Jinhui [4 ,5 ]
Wei, Xiangqin [6 ]
Yao, Yunjun [1 ,2 ]
Wang, Bing [1 ,2 ]
Sun, Yuan [6 ]
Wang, Lei [7 ]
机构
[1] Beijing Normal Univ, Fac Geog Sci, State Key Lab Remote Sensing Sci, Beijing 100875, Peoples R China
[2] Beijing Normal Univ, Fac Geog Sci, Beijing Engn Res Ctr Global Land Remote Sensing P, Beijing 100875, Peoples R China
[3] Minist Nat Resource Peoples Republ China, Land Satellite Remote Sensing Applicat Ctr, Beijing 100048, Peoples R China
[4] Beijing Engn Res Ctr Ind Spectrum Imaging, Beijing 100083, Peoples R China
[5] Univ Sci & Technol Beijing, Sch Automat & Elect Engn, Beijing 100083, Peoples R China
[6] Chinese Acad Sci, Aerosp Informat Res Inst, Beijing 100101, Peoples R China
[7] Ningxia Univ, Northwest Natl Key Lab Breeding Base Land Degrada, Yinchuan 750021, Ningxia, Peoples R China
关键词
GF-5; LAI; feature selection; machine learning; TEMPERATURE CONDITION INDEX; VEGETATION INDEXES; DIMENSIONALITY REDUCTION; ABOVEGROUND BIOMASS; CHLOROPHYLL CONTENT; NEURAL-NETWORK; LAI ESTIMATION; SENTINEL-2; REGRESSION; INVERSION;
D O I
10.3390/rs12132110
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Leaf area index (LAI) is an essential vegetation parameter that represents the light energy utilization and vegetation canopy structure. As the only in-operation hyperspectral satellite launched by China, GF-5 is potentially useful for accurate LAI estimation. However, there is no research focus on evaluating GF-5 data for LAI estimation. Hyperspectral remote sensing data contains abundant information about the reflective characteristics of vegetation canopies, but these abound data also easily result in a dimensionality curse. Therefore, feature selection (FS) is necessary to reduce data redundancy to achieve more reliable estimations. Currently, machine learning (ML) algorithms have been widely used for FS. Moreover, the same ML algorithm is usually conducted for both FS and regression in LAI estimation. However, no evidence suggests that this is the optimal solution. Therefore, this study focuses on evaluating the capacity of GF-5 spectral reflectance for estimating LAI and the performances of different combination of FS and ML algorithms. Firstly, the PROSAIL model, which coupled leaf optical properties model PROSPECT and the scattering by arbitrarily inclined leaves (SAIL) model, was used to generate simulated GF-5 reflectance data under different vegetation and soil conditions, and then three FS methods, including random forest (RF), K-means clustering (K-means) and mean impact value (MIV), and three ML algorithms, including random forest regression (RFR), back propagation neural network (BPNN) and K-nearest neighbor (KNN) were used to develop nine LAI estimation models. The FS process was conducted twice using different strategies: Firstly, three FS methods were conducted to search the lowest dimension number, which maintained the estimation accuracy of all bands. Then, the sequential backward selection (SBS) method was used to eliminate the bands having minimal impact on LAI estimation accuracy. Finally, three best estimation models were selected and evaluated using reference LAI. The results showed that although the RF_RFR model (RF used for feature selection and RFR used for regression) achieved reliable LAI estimates (coefficient of determination (R-2) = 0.828, root mean square error (RMSE) = 0.839), the poor performance (R-2= 0.763, RMSE = 0.987) of the MIV_BPNN model (MIV used for feature selection and BPNN used for regression) suggested using feature selection and regression conducted by the same ML algorithm could not always ensure an optimal estimation. Moreover, RF selection preserved the most informative bands for LAI estimation so that each ML regression method could achieve satisfactory estimation results. Finally, the results indicated that the RF_KNN model (RF used as feature selection and KNN used for regression) with seven GF-5 spectral band reflectance achieved the better estimation results than others when validated by simulated data (R-2= 0.834, RMSE = 0.824) and actual reference LAI (R-2= 0.659, RMSE = 0.697).
引用
收藏
页数:23
相关论文
共 50 条
  • [1] Leaf area index estimation model for UAV image hyperspectral data based on wavelength variable selection and machine learning methods
    Zhang, Juanjuan
    Cheng, Tao
    Guo, Wei
    Xu, Xin
    Qiao, Hongbo
    Xie, Yimin
    Ma, Xinming
    [J]. PLANT METHODS, 2021, 17 (01)
  • [2] Leaf area index estimation model for UAV image hyperspectral data based on wavelength variable selection and machine learning methods
    Juanjuan Zhang
    Tao Cheng
    Wei Guo
    Xin Xu
    Hongbo Qiao
    Yimin Xie
    Xinming Ma
    [J]. Plant Methods, 17
  • [3] Comparison of fusion methods on GF-5 hyperspectral data
    Zhang L.
    Zhao X.
    Sun X.
    Huang H.
    Peng M.
    Cen Y.
    Tu K.
    [J]. National Remote Sensing Bulletin, 2022, 26 (04) : 632 - 645
  • [4] Estimation of the Leaf Area Index of Winter Rapeseed Based on Hyperspectral and Machine Learning
    Zhang, Wei
    Li, Zhijun
    Pu, Yang
    Zhang, Yunteng
    Tang, Zijun
    Fu, Junyu
    Xu, Wenjie
    Xiang, Youzhen
    Zhang, Fucang
    [J]. SUSTAINABILITY, 2023, 15 (17)
  • [5] Leaf Area Index Estimation Based on UAV Hyperspectral Band Selection
    Kong Yu-ru
    Wang Li-juan
    Feng Hai-kuan
    Xu Yi
    Liang Liang
    Xu Lu
    Yang Xiao-dong
    Zhang Qing-qi
    [J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2022, 42 (03) : 933 - 939
  • [6] Estimation of paddy rice leaf area index using machine learning methods based on hyperspectral data from multi-year experiments
    Wang, Li
    Chang, Qingrui
    Yang, Jing
    Zhang, Xiaohua
    Li, Fenling
    [J]. PLOS ONE, 2018, 13 (12):
  • [7] Cloud detection algorithm based on GF-5 DPC data
    Wei, Lesi
    Shang, Huazhe
    Husi, Letu
    Ma, Run
    Hu, Dahai
    Chao, Kefu
    Si, Fuqi
    Shi, Jiancheng
    [J]. National Remote Sensing Bulletin, 2021, 25 (10) : 2053 - 2066
  • [8] The estimation model of rice leaf area index using hyperspectral data based on support vector machine
    Yang Xiao-hua
    Huang Jing-feng
    Wang Xiu-zhen
    Wang Fu-min
    [J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2008, 28 (08) : 1837 - 1841
  • [9] Leaf Area Index Estimation of Spring Maize with Canopy Hyperspectral Data Based on Linear Regression Algorithm
    Wang Hong-bo
    Zhao Zi-qi
    Lin Yi
    Feng Rui
    Li Li-guang
    Zhao Xian-li
    Wen Ri-hong
    Wei Nan
    Yao Xin
    Zhang Yu-shu
    [J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2017, 37 (05) : 1489 - 1496
  • [10] Investigating GF-5 Hyperspectral and GF-1 Multispectral Data Fusion Methods for Multitemporal Change Analysis
    Sun, Weiwei
    Ren, Kai
    Yang, Gang
    Meng, Xiangchao
    Liu, Yinnian
    [J]. 2019 10TH INTERNATIONAL WORKSHOP ON THE ANALYSIS OF MULTITEMPORAL REMOTE SENSING IMAGES (MULTITEMP), 2019,