A surrogate model based on feature selection techniques and regression learners to improve soybean yield prediction in southern France

被引:23
|
作者
Corrales, David Camilo [1 ,2 ]
Schoving, Celine [3 ]
Raynal, Helene [1 ]
Debaeke, Philippe [1 ]
Journet, Etienne-Pascal [1 ,4 ]
Constantin, Julie [1 ]
机构
[1] Univ Toulouse, INRAE, UMR AGIR, F-31326 Castanet Tolosan, France
[2] Univ Cauca, Grp Ingn Telemat, Popayan, Colombia
[3] Terres Inovia, Baziege, France
[4] Univ Toulouse, INRAE, CNRS, LIPME, F-31326 Castanet Tolosan, France
基金
欧盟地平线“2020”;
关键词
STICS; Regression learners; Filter; Wrapper; Embedded; SOIL-CROP MODEL; CLASSIFICATION; STICS; CALIBRATION; ACCURACY; FILTER; WATER;
D O I
10.1016/j.compag.2021.106578
中图分类号
S [农业科学];
学科分类号
09 ;
摘要
Empirical and process-based models are currently used to predict crop yield at field and regional levels. A mechanistic model named STICS (Multidisciplinary Simulator for Standard Crops) has been used to simulate soybean grain yield in several environments, including southern France. STICS simulates at a daily step the effects of climate, soil and management practices on plant growth, development and production. In spite of good performances to predict total aboveground biomass, poor results were obtained for final grain yield. In order to improve yield prediction, a surrogate model was developed from STICS dynamic simulations, feature selection techniques and regression learners. STICS was used to simulate functional variables at given growth stages and over selected phenological phases. The most representative variables were selected through feature selection techniques (filter, wrapper and embedded), and a subset of variables were used to train the regression learners Linear regression (LR), Support vector regression (SVR), Back propagation neural network (BPNN), Random forest (RF), Least Absolute Shrinkage and Selection Operator (LASSO) and M5 decision tree. The subset of variables selected by wrapper method combined with regression models SVR (R2 = 0. 7102; subset of variables = 6) and LR (R2 = 0. 6912; subset of variables = 14) provided the best results. SVR and LR models improved significantly the soybean yield predictions in southern France in comparison to STICS simulations (R2 = 0.040).
引用
收藏
页数:19
相关论文
共 50 条
  • [21] UAV-based coffee yield prediction utilizing feature selection and deep learning
    Barbosa, Brenon Diennevan Souza
    Ferraz, Gabriel Araujo e Silva
    Costa, Lucas
    Ampatzidis, Yiannis
    Vijayakumar, Vinay
    Santos, Luana Mendes dos
    SMART AGRICULTURAL TECHNOLOGY, 2021, 1
  • [22] From Baseline to Best Practice: An Advanced Feature Selection, Feature Resampling and Grid Search Techniques to Improve Injury Severity Prediction
    EL Ferouali, Soukaina
    Abou Elassad, Zouhair Elamrani
    Qassimi, Sara
    Abdali, Abdelmounaim
    APPLIED ARTIFICIAL INTELLIGENCE, 2025, 39 (01)
  • [23] Unsupervised feature selection based on generalized regression model with linear discriminant constraints
    Xiangguang Dai
    Mingyu Guan
    Facheng Dai
    Wei Zhang
    Tingji Zhang
    Hangjun Che
    Xiangqin Dai
    Complex & Intelligent Systems, 2025, 11 (6)
  • [24] Adaptive graph-based generalized regression model for unsupervised feature selection
    Huang, Yanyong
    Shen, Zongxin
    Cai, Fuxu
    Li, Tianrui
    Lv, Fengmao
    KNOWLEDGE-BASED SYSTEMS, 2021, 227
  • [25] On the influence of feature selection in fuzzy rule-based regression model generation
    Antonelli, Michela
    Ducange, Pietro
    Marcelloni, Francesco
    Segatori, Armando
    INFORMATION SCIENCES, 2016, 329 : 649 - 669
  • [26] A surrogate model-based approach for adaptive selection of the optimal traffic conflict prediction model
    Wu, Dan
    Lee, Jaeyoung Jay
    Li, Ye
    Li, Jipu
    Tian, Shan
    Yang, Zhanhao
    ACCIDENT ANALYSIS AND PREVENTION, 2024, 207
  • [27] Recognition of soybean pods and yield prediction based on improved deep learning model
    He, Haotian
    Ma, Xiaodan
    Guan, Haiou
    Wang, Feiyi
    Shen, Panpan
    FRONTIERS IN PLANT SCIENCE, 2023, 13
  • [28] PREDICTION-BASED MODEL SELECTION FOR BAYESIAN MULTIPLE REGRESSION MODELS
    Pintar, Adam L.
    Anderson-Cook, Christine M.
    Wu, Huaiqing
    ADVANCES AND APPLICATIONS IN STATISTICS, 2013, 32 (02) : 83 - 117
  • [29] A many objective based feature selection model for software defect prediction
    Mao, Qi
    Zhang, Jingbo
    Zhao, Tianhao
    Cai, Xingjuan
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2024, 36 (19):
  • [30] A diabetes prediction model based on Boruta feature selection and ensemble learning
    Hongfang Zhou
    Yinbo Xin
    Suli Li
    BMC Bioinformatics, 24