Cotton planting area extraction and yield prediction based on Sentinel-2A

被引:0
|
作者
Wang H. [1 ]
Zhang Z. [1 ]
Kang X. [2 ]
Lin J. [3 ]
Yin C. [1 ]
Ma L. [1 ]
Huang C. [2 ]
Lyu X. [1 ]
机构
[1] Agricultural College of Shihezi University, The Key Laboratory of Oasis Eco-agriculture, Xinjiang Production and Construction Group, Shihezi
[2] Aerospace Information Research Institute, Chinese Academy of Sciences, Beijing
[3] Agricultural College of Tarim University, Aral
关键词
cotton; feature selection; Google Earth Engine; planting area; remote sensing; vegetation index; yield;
D O I
10.11975/j.issn.1002-6819.2022.09.022
中图分类号
学科分类号
摘要
Timely and accurate prediction of cotton yield has been one of the most important steps in cotton field management and decision making. The cotton yield can be recognized as a continuous and dynamic process, particularly depending mainly on the canopy structure, biomass, and chlorophyll content. In most previous studies, the vegetation index with the highest correlation can often be used to establish a regression equation with the cotton yield as a prediction model. However, a single vegetation index cannot effectively explain the complexity of cotton yield prediction, leading to the error and instability of the model during the subsequent application. In this study, multiple vegetation indices were selected to explore the promising potential in cotton yield prediction. Cotton in the Mosuwan Reclamation Region of Xinjiang, China was taken as the research object. Sentinel-2A image data was set as the data source in 2020. These remote sensing images were obtained to extract the cotton planting area for the yield prediction by combing with the Google Earth Engine (GEE) cloud platform and machine learning. A two-step grading cotton mapping strategy was adopted to extract the cotton from the images. In the first step, 17 features (12 spectral, 3 vegetation index, and 2 DEM features) were selected to extract the farmland by excluding non-crops. In the second step, three classification methods (Random Forest, RF), Support Vector Machine (SVM), and decision tree (CART)) were used to screen the four indicators (Overall Accuracy (OA), Producer’s Accuracy (PA), User's Accuracy (UA), and Kappa Coefficient (KC)), where the best was selected to extract the cotton farmland. Then, the cotton fields at the florescence, boll, and boll opening stages were extracted from the remote sensing images with eight bands at visible, near-infrared, and red edge ranges. Fourteen vegetation indexes (six canopy structure indexes and eight chlorophyll related indexes) under these bands were calculated via the Sequential Forward Selection (SFS) for different growth stages. SPXY (Sample set Partitioning based on joint X-Y distances) sample classification was selected to divide into the training and prediction set, in terms of the features and the cotton yields. The prediction models of cotton yield were constructed at different growth stages using Partial Least Squares Regression (PLSR). The best growth period was determined to compare the accuracy of the models for the cotton yield prediction. Finally, the independent samples were selected to verify the model. The prediction model was then applied to the extracted cotton planting area map to predict the distribution map of cotton yield. The results show that: 1) the RF was the best classification. The cropland-non cropland classification PA, UA, OA, and KC were 0.92, 0.96, 0.94, and 0.89, respectively, which were significantly better than those of SVM and CART. The PA and UA of cotton-non cotton field classification reached 0.95 and 0.87, respectively, whereas, the OA and KC were 0.92, and 0.83, respectively. The RF classified and actual areas of the cotton field were 60 400, and 64 866.7 hectares, respectively, with a relative error of 6.9 %. 2) The red edge band (B6) was set as the first selected feature in the three growth periods, indicating an excellent correlation with the yield, where the correlation coefficient (0.37, 0.47, and 0.53) increased with the three growth period. 3) The boll stage was the best growth stage for the cotton yield prediction (determination coefficient R2=0.62, root mean square error RMSE=625.5 kg/hm2, relative error RE=8.87%) using PLSR, which was better than that at boll opening (R2=0.51, RMSE=789.45 kg/hm2, RE=11.06%) and florescence (R2=0.48, RMSE=686.4 kg/hm2, RE=9.86%) stages. Consequently, the high-performance computing power was achieved by the GEE and Sentinel-2A image data, further determining the cotton yield prediction model at a regional scale. The finding can provide a new idea for the cotton planting area extraction and yield prediction, particularly for the monitoring of cotton crop growth in precision planting. © 2022 Chinese Society of Agricultural Engineering. All rights reserved.
引用
收藏
页码:205 / 214
页数:9
相关论文
共 49 条
  • [1] Khan M A, Wahid A, Ahmad M, Et al., World Cotton Production and Consumption: An Overview, Cotton Production and Uses
  • [2] Wang Yaomin, Chen Haorui, Chen Junying, Et al., Comparation of rice yield estimation model combining spectral index screening method and statistical regression algorithm, Transactions of the Chinese Society of AgriculturalEngineering (Transactions of the CSAE), 37, 21, pp. 208-216, (2021)
  • [3] Weiss M, Jacob F, Duveiller G., Remote sensing for agricultural applications: A meta-review, Remote Sensing of Environment, 236, (2020)
  • [4] Lin Ying, Shao Huaiyong, Study on optimal time and influencing factors for winter wheat yield prediction in henan based on random forest algorithm, Journal of Triticeae Crops, 40, 7, pp. 874-880, (2020)
  • [5] Deb D, Mandal S, Deb S, Et al., Crop Production Estimation Using Remote Sensing, Geospatial Technologies for Crops and Soils, pp. 229-243, (2021)
  • [6] Zhao Wenliang, He Zhen, He Junping, Et al., Remote sensing estimation for winter wheat yield in Henan based on the MODIS-NDVI data, Geographical Research, 31, 12, pp. 2310-2320, (2012)
  • [7] Ji Z, Pan Y, Zhu X, Et al., Prediction of crop yield using phenological information extracted from remote sensing vegetation index, Sensors, 21, 4, (2021)
  • [8] Liu Huanjun, Meng Linghua, Zhang Xinle, Et al., Estimation model of cotton yield with time series Landsat images, Transactions of the Chinese Society of Agricultural Engineering (Transactions of the CSAE), 31, 17, pp. 215-220, (2015)
  • [9] Lu Xinxin, Correlation Between Vegetation Index and Yield of Alar Cotton Field Based on Landsat, (2021)
  • [10] Huang Y, Sui R, Thomson S J, Et al., Estimation of cotton yield with varied irrigation and nitrogen treatments using aerial multispectral imagery, International Journal of Agricultural and Biological Engineering, 6, 2, pp. 37-41, (2013)