LASSO with cross-validation for genomic selection

被引:118
|
作者
Usai, M. Graziano [1 ]
Goddard, Mike E. [2 ,3 ]
Hayes, Ben J. [3 ]
机构
[1] AGRIS Sardegna, Settore Genet & Biotecnol, I-07040 Olmedo, SS, Italy
[2] Univ Melbourne, Fac Land & Food Resources, Parkville, Vic 3010, Australia
[3] Dept Primary Ind Victoria, Biosci Res Div, Bundoora, Vic 3083, Australia
关键词
QUANTITATIVE TRAITS; GENETIC VALUE; PREDICTION;
D O I
10.1017/S0016672309990334
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
We used a least absolute shrinkage and selection operator (LASSO) approach to estimate marker effects for genomic selection. The least angle regression (LARS) algorithm and cross-validation were used to define the best subset of markers to include in the model. The LASSO-LARS approach was tested oil two data sets: a simulated data set with 5865 individuals and 6000 Single Nucleotide Polymorphisms (SNPs); and a Mouse data set with 1885 individuals genotyped for 10656 SNPs and phenotyped for a number of quantitative traits. In the simulated data, three approaches were Used to split the reference population into training and validation subsets for cross-validation: random Splitting across the Whole population random sampling of validation set from the last generation only, either within or across Families. The highest accuracy was obtained by random splitting across the whole population. The accuracy of genomic estimated breeding Values (GEBVs) in the candidate Population obtained by LASSO-LARS was 0-89 with 156 explanatory SNPs. This Value Was higher than those obtained by Best Linear Unbiased Prediction (BLUP) and a Bayesian method (BayesA), which were 0-75 and 0-84, respectively. In the Mouse data, 1600 individuals Were randomly allocated to the reference population. The GEBVs for the remaining 285 individuals estimated by LASSO-LARS were more accurate than those obtained by BLUP and BayesA for weight at six weeks and slightly lower for growth rate and body length. It Was Concluded that LASSO-LARS approach is a good alternative method to estimate marker effects for genomic Selection, particularly when the cost of genotyping can be reduced by using a limited Subset of markers.
引用
收藏
页码:427 / 436
页数:10
相关论文
共 50 条
  • [1] On the use of cross-validation for the calibration of the adaptive lasso
    Ballout, Nadim
    Etievant, Lola
    Viallon, Vivian
    [J]. BIOMETRICAL JOURNAL, 2023, 65 (05)
  • [2] Stabilizing the lasso against cross-validation variability
    Roberts, S.
    Nowak, G.
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2014, 70 : 198 - 211
  • [3] The Accuracy of Genomic Selection in Norwegian Red Cattle Assessed by Cross-Validation
    Luan, Tu
    Woolliams, John A.
    Lien, Sigbjorn
    Kent, Matthew
    Svendsen, Morten
    Meuwissen, Theo H. E.
    [J]. GENETICS, 2009, 183 (03) : 1119 - 1126
  • [4] A Note on Cross-Validation for Lasso Under Measurement Errors
    Datta, Abhirup
    Zou, Hui
    [J]. TECHNOMETRICS, 2020, 62 (04) : 549 - 556
  • [5] RISK CONSISTENCY OF CROSS-VALIDATION WITH LASSO-TYPE PROCEDURES
    Homrighausen, Darren
    McDonald, Daniel J.
    [J]. STATISTICA SINICA, 2017, 27 (03) : 1017 - 1036
  • [6] Granularity selection for cross-validation of SVM
    Liu, Yong
    Liao, Shizhong
    [J]. INFORMATION SCIENCES, 2017, 378 : 475 - 483
  • [7] Linear model selection by cross-validation
    Rao, CR
    Wu, Y
    [J]. JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2005, 128 (01) : 231 - 240
  • [8] Leave-one-out cross-validation is risk consistent for lasso
    Darren Homrighausen
    Daniel J. McDonald
    [J]. Machine Learning, 2014, 97 : 65 - 78
  • [9] Leave-one-out cross-validation is risk consistent for lasso
    Homrighausen, Darren
    McDonald, Daniel J.
    [J]. MACHINE LEARNING, 2014, 97 (1-2) : 65 - 78
  • [10] Robust linear model selection by cross-validation
    Ronchetti, E
    Field, C
    Blanchard, W
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1997, 92 (439) : 1017 - 1023