A surrogate model based on feature selection techniques and regression learners to improve soybean yield prediction in southern France

被引:23
|
作者
Corrales, David Camilo [1 ,2 ]
Schoving, Celine [3 ]
Raynal, Helene [1 ]
Debaeke, Philippe [1 ]
Journet, Etienne-Pascal [1 ,4 ]
Constantin, Julie [1 ]
机构
[1] Univ Toulouse, INRAE, UMR AGIR, F-31326 Castanet Tolosan, France
[2] Univ Cauca, Grp Ingn Telemat, Popayan, Colombia
[3] Terres Inovia, Baziege, France
[4] Univ Toulouse, INRAE, CNRS, LIPME, F-31326 Castanet Tolosan, France
基金
欧盟地平线“2020”;
关键词
STICS; Regression learners; Filter; Wrapper; Embedded; SOIL-CROP MODEL; CLASSIFICATION; STICS; CALIBRATION; ACCURACY; FILTER; WATER;
D O I
10.1016/j.compag.2021.106578
中图分类号
S [农业科学];
学科分类号
09 ;
摘要
Empirical and process-based models are currently used to predict crop yield at field and regional levels. A mechanistic model named STICS (Multidisciplinary Simulator for Standard Crops) has been used to simulate soybean grain yield in several environments, including southern France. STICS simulates at a daily step the effects of climate, soil and management practices on plant growth, development and production. In spite of good performances to predict total aboveground biomass, poor results were obtained for final grain yield. In order to improve yield prediction, a surrogate model was developed from STICS dynamic simulations, feature selection techniques and regression learners. STICS was used to simulate functional variables at given growth stages and over selected phenological phases. The most representative variables were selected through feature selection techniques (filter, wrapper and embedded), and a subset of variables were used to train the regression learners Linear regression (LR), Support vector regression (SVR), Back propagation neural network (BPNN), Random forest (RF), Least Absolute Shrinkage and Selection Operator (LASSO) and M5 decision tree. The subset of variables selected by wrapper method combined with regression models SVR (R2 = 0. 7102; subset of variables = 6) and LR (R2 = 0. 6912; subset of variables = 14) provided the best results. SVR and LR models improved significantly the soybean yield predictions in southern France in comparison to STICS simulations (R2 = 0.040).
引用
收藏
页数:19
相关论文
共 50 条
  • [41] Development of an early prediction model for vomiting during hemodialysis using LASSO regression and Boruta feature selection
    Chen, Jiajia
    Shen, Cheng
    Xue, Haiyan
    Yuan, Benyin
    Zheng, Bing
    Shen, Lianglan
    Fang, Xingxing
    SCIENTIFIC REPORTS, 2025, 15 (01):
  • [42] Feature selection using particle swarm optimization-based logistic regression model
    Qasim, Omar Saber
    Algamal, Zakariya Yahya
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2018, 182 : 41 - 46
  • [43] A Ship Energy Consumption Prediction Method Based on TGMA Model and Feature Selection
    Liu, Yuhang
    Wang, Kai
    Lu, Yong
    Zhang, Yongfeng
    Li, Zhongwei
    Ma, Ranqi
    Huang, Lianzhong
    JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2024, 12 (07)
  • [44] Survival risk prediction model for ESCC based on relief feature selection and CNN
    Wang, Yanfeng
    Zhu, Chuanqian
    Wang, Yan
    Sun, Junwei
    Ling, Dan
    Wang, Lidong
    COMPUTERS IN BIOLOGY AND MEDICINE, 2022, 145
  • [45] A stacking ensemble model based on nonlinear feature selection for photovoltaic power prediction
    Tang, Xin
    Zhang, Haiqing
    Li, Daiwei
    Tang, Dan
    Gong, Cheng
    Yu, Xi
    2024 7TH ASIA CONFERENCE ON ENERGY AND ELECTRICAL ENGINEERING, ACEEE 2024, 2024, : 345 - 349
  • [46] A Gas Emission Prediction Model Based on Feature Selection and Improved Machine Learning
    Shao, Liangshan
    Zhang, Kun
    PROCESSES, 2023, 11 (03)
  • [47] Stock price prediction model based on BP neural network for feature selection
    Lin, Chaoxiong
    2ND INTERNATIONAL CONFERENCE ON APPLIED MATHEMATICS, MODELLING, AND INTELLIGENT COMPUTING (CAMMIC 2022), 2022, 12259
  • [48] LightGBM Low-Temperature Prediction Model Based on LassoCV Feature Selection
    Duan, Shangqi
    Huang, Shuangde
    Bu, Wei
    Ge, Xingke
    Chen, Haidong
    Liu, Jing
    Luo, Jiqiang
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2021, 2021
  • [49] Prediction Model of Converter Oxygen Consumption Based on Recursive Classification and Feature Selection
    Zhang Liu
    Zheng Zhong
    Zhang Kaitian
    Shen Xinyue
    Wang Yongzhou
    ENERGY TECHNOLOGY 2021: CARBON DIOXIDE MANAGEMENT AND OTHER TECHNOLOGIES, 2021, : 95 - 110
  • [50] Satellite-based soybean yield prediction in Argentina: A comparison between panel regression and deep learning methods
    Wang, Yuhao
    Feng, Kuishuang
    Sun, Laixiang
    Xie, Yiqun
    Song, Xiao-Peng
    COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2024, 221