A surrogate model based on feature selection techniques and regression learners to improve soybean yield prediction in southern France

被引:24
|
作者
Corrales, David Camilo [1 ,2 ]
Schoving, Celine [3 ]
Raynal, Helene [1 ]
Debaeke, Philippe [1 ]
Journet, Etienne-Pascal [1 ,4 ]
Constantin, Julie [1 ]
机构
[1] Univ Toulouse, INRAE, UMR AGIR, F-31326 Castanet Tolosan, France
[2] Univ Cauca, Grp Ingn Telemat, Popayan, Colombia
[3] Terres Inovia, Baziege, France
[4] Univ Toulouse, INRAE, CNRS, LIPME, F-31326 Castanet Tolosan, France
基金
欧盟地平线“2020”;
关键词
STICS; Regression learners; Filter; Wrapper; Embedded; SOIL-CROP MODEL; CLASSIFICATION; STICS; CALIBRATION; ACCURACY; FILTER; WATER;
D O I
10.1016/j.compag.2021.106578
中图分类号
S [农业科学];
学科分类号
09 ;
摘要
Empirical and process-based models are currently used to predict crop yield at field and regional levels. A mechanistic model named STICS (Multidisciplinary Simulator for Standard Crops) has been used to simulate soybean grain yield in several environments, including southern France. STICS simulates at a daily step the effects of climate, soil and management practices on plant growth, development and production. In spite of good performances to predict total aboveground biomass, poor results were obtained for final grain yield. In order to improve yield prediction, a surrogate model was developed from STICS dynamic simulations, feature selection techniques and regression learners. STICS was used to simulate functional variables at given growth stages and over selected phenological phases. The most representative variables were selected through feature selection techniques (filter, wrapper and embedded), and a subset of variables were used to train the regression learners Linear regression (LR), Support vector regression (SVR), Back propagation neural network (BPNN), Random forest (RF), Least Absolute Shrinkage and Selection Operator (LASSO) and M5 decision tree. The subset of variables selected by wrapper method combined with regression models SVR (R2 = 0. 7102; subset of variables = 6) and LR (R2 = 0. 6912; subset of variables = 14) provided the best results. SVR and LR models improved significantly the soybean yield predictions in southern France in comparison to STICS simulations (R2 = 0.040).
引用
收藏
页数:19
相关论文
共 50 条
  • [1] Feature Selection for Surrogate Model-Based Optimization
    Rehbach, Frederik
    Gentile, Lorenzo
    Bartz-Beielstein, Thomas
    PROCEEDINGS OF THE 2019 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE COMPANION (GECCCO'19 COMPANION), 2019, : 399 - 400
  • [2] Mutual Information Feature Selection (MIFS) Based Crop Yield Prediction on Corn and Soybean Crops Using Multilayer Stacked Ensemble Regression (MSER)
    S. Iniyan
    R. Jebakumar
    Wireless Personal Communications, 2022, 126 : 1935 - 1964
  • [3] Mutual Information Feature Selection (MIFS) Based Crop Yield Prediction on Corn and Soybean Crops Using Multilayer Stacked Ensemble Regression (MSER)
    Iniyan, S.
    Jebakumar, R.
    WIRELESS PERSONAL COMMUNICATIONS, 2022, 126 (03) : 1935 - 1964
  • [4] Assessing Feature Selection Techniques for a Colorectal Cancer Prediction Model
    Cueto-Lopez, Nahum
    Alaiz-Rodriguez, Rocio
    Teresa Garcia-Ordas, Maria
    Gonzalez-Donquiles, Carmen
    Martin, Vicente
    INTERNATIONAL JOINT CONFERENCE SOCO'17- CISIS'17-ICEUTE'17 PROCEEDINGS, 2018, 649 : 471 - 481
  • [5] Feature selection in dispatching rules based on surrogate model genetic programming
    Zeng L.
    Li Y.
    Wang S.
    Quan R.
    Huazhong Keji Daxue Xuebao (Ziran Kexue Ban)/Journal of Huazhong University of Science and Technology (Natural Science Edition), 2023, 51 (02): : 139 - 145
  • [6] Accelerating band gap prediction for solar materials using feature selection and regression techniques
    Khmaissia, Fadoua
    Frigui, Hichem
    Sunkara, Mahendra
    Jasinski, Jacek
    Garcia, Alejandro Martinez
    Pace, Tom
    Menon, Madhu
    COMPUTATIONAL MATERIALS SCIENCE, 2018, 147 : 304 - 315
  • [7] Feature Selection Based Machine Learning to Improve Prediction of Parkinson Disease
    Nahar, Nazmun
    Ara, Ferdous
    Neloy, Md Arif Istiek
    Biswas, Anik
    Hossain, Mohammad Shahadat
    Andersson, Karl
    BRAIN INFORMATICS, BI 2021, 2021, 12960 : 496 - 508
  • [8] Building Energy Use Surrogate Model Feature Selection - A Methodology Using Forward Stepwise Selection and LASSO Regression Methods
    Barnes, Erica C.
    McArthur, J. J.
    PROCEEDINGS OF BUILDING SIMULATION 2019: 16TH CONFERENCE OF IBPSA, 2020, : 3078 - 3085
  • [9] Obsolescence Prediction based on Joint Feature Selection and Machine Learning Techniques
    Trabelsi, Imen
    Zeddini, Besma
    Zolghadri, Marc
    Barkallah, Maher
    Haddar, Mohamed
    ICAART: PROCEEDINGS OF THE 13TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE - VOL 2, 2021, : 787 - 794
  • [10] Predictive Model to Analyze Real and Synthetic Data for Learners' Performance Prediction Using Regression Techniques
    Shabnam, Aras S. J.
    Ramachandriah, Tanuja
    Haladappa, Manjula S.
    ONLINE LEARNING, 2025, 29 (01):