A surrogate model based on feature selection techniques and regression learners to improve soybean yield prediction in southern France

被引:23
|
作者
Corrales, David Camilo [1 ,2 ]
Schoving, Celine [3 ]
Raynal, Helene [1 ]
Debaeke, Philippe [1 ]
Journet, Etienne-Pascal [1 ,4 ]
Constantin, Julie [1 ]
机构
[1] Univ Toulouse, INRAE, UMR AGIR, F-31326 Castanet Tolosan, France
[2] Univ Cauca, Grp Ingn Telemat, Popayan, Colombia
[3] Terres Inovia, Baziege, France
[4] Univ Toulouse, INRAE, CNRS, LIPME, F-31326 Castanet Tolosan, France
基金
欧盟地平线“2020”;
关键词
STICS; Regression learners; Filter; Wrapper; Embedded; SOIL-CROP MODEL; CLASSIFICATION; STICS; CALIBRATION; ACCURACY; FILTER; WATER;
D O I
10.1016/j.compag.2021.106578
中图分类号
S [农业科学];
学科分类号
09 ;
摘要
Empirical and process-based models are currently used to predict crop yield at field and regional levels. A mechanistic model named STICS (Multidisciplinary Simulator for Standard Crops) has been used to simulate soybean grain yield in several environments, including southern France. STICS simulates at a daily step the effects of climate, soil and management practices on plant growth, development and production. In spite of good performances to predict total aboveground biomass, poor results were obtained for final grain yield. In order to improve yield prediction, a surrogate model was developed from STICS dynamic simulations, feature selection techniques and regression learners. STICS was used to simulate functional variables at given growth stages and over selected phenological phases. The most representative variables were selected through feature selection techniques (filter, wrapper and embedded), and a subset of variables were used to train the regression learners Linear regression (LR), Support vector regression (SVR), Back propagation neural network (BPNN), Random forest (RF), Least Absolute Shrinkage and Selection Operator (LASSO) and M5 decision tree. The subset of variables selected by wrapper method combined with regression models SVR (R2 = 0. 7102; subset of variables = 6) and LR (R2 = 0. 6912; subset of variables = 14) provided the best results. SVR and LR models improved significantly the soybean yield predictions in southern France in comparison to STICS simulations (R2 = 0.040).
引用
收藏
页数:19
相关论文
共 50 条
  • [31] A diabetes prediction model based on Boruta feature selection and ensemble learning
    Zhou, Hongfang
    Xin, Yinbo
    Li, Suli
    BMC BIOINFORMATICS, 2023, 24 (01)
  • [32] Price Prediction of Bitcoin Based on Adaptive Feature Selection and Model Optimization
    Zhu, Yingjie
    Ma, Jiageng
    Gu, Fangqing
    Wang, Jie
    Li, Zhijuan
    Zhang, Youyao
    Xu, Jiani
    Li, Yifan
    Wang, Yiwen
    Yang, Xiangqun
    MATHEMATICS, 2023, 11 (06)
  • [33] Photovoltaic power prediction of LSTM model based on Pearson feature selection
    Chen, Hailang
    Chang, Xianfa
    ENERGY REPORTS, 2021, 7 : 1047 - 1054
  • [34] Photovoltaic power prediction of LSTM model based on Pearson feature selection
    Chen, Hailang
    Chang, Xianfa
    Energy Reports, 2021, 7 : 1047 - 1054
  • [35] An enhanced feature representation based on linear regression model for stock market prediction
    Ihlayyel, Hani A. K.
    Sharef, Nurfadhlina Mohd
    Nazri, Mohd Zakree Ahmed
    Abu Bakar, Azuraliza
    INTELLIGENT DATA ANALYSIS, 2018, 22 (01) : 45 - 76
  • [36] Feature Selection Techniques Applied to Voice-based Prediction of Parkinson's Disease
    Calibuyot, Dhone Matthews M.
    Ednalan, Emmanuel D.
    Ortega, Nathaniel M.
    Magboo, Ma Sheila A.
    Magboo, Vincent Peter C.
    2024 FIFTEENTH INTERNATIONAL CONFERENCE ON UBIQUITOUS AND FUTURE NETWORKS, ICUFN 2024, 2024, : 263 - 267
  • [37] Surrogate-Model Based Particle Swarm Optimisation with Local Search for Feature Selection in Classification
    Hoai Bach Nguyen
    Xue, Bing
    Andreae, Peter
    APPLICATIONS OF EVOLUTIONARY COMPUTATION, EVOAPPLICATIONS 2017, PT I, 2017, 10199 : 487 - 505
  • [38] Heuristic Model to Improve Feature Selection Based on Machine Learning in Data Mining
    Majumdar, Jahin
    Mal, Anwesha
    Gupta, Shruti
    2016 6TH INTERNATIONAL CONFERENCE - CLOUD SYSTEM AND BIG DATA ENGINEERING (CONFLUENCE), 2016, : 73 - 77
  • [39] Soil Organic Carbon Prediction Based on Different Combinations of Hyperspectral Feature Selection and Regression Algorithms
    Chang, Naijie
    Jing, Xiaowen
    Zeng, Wenlong
    Zhang, Yungui
    Li, Zhihong
    Chen, Di
    Jiang, Daibing
    Zhong, Xiaoli
    Dong, Guiquan
    Liu, Qingli
    AGRONOMY-BASEL, 2023, 13 (07):
  • [40] Application of gradient boosting regression model for the evaluation of feature selection techniques in improving reservoir characterisation predictions
    Otchere, Daniel Asante
    Ganat, Tarek Omar Arbi
    Ojero, Jude Oghenerurie
    Tackie-Otoo, Bennet Nii
    Taki, Mohamed Yassir
    JOURNAL OF PETROLEUM SCIENCE AND ENGINEERING, 2022, 208