Application of high-dimensional feature selection: evaluation for genomic prediction in man

被引:0
|
作者
M. L. Bermingham
R. Pong-Wong
A. Spiliopoulou
C. Hayward
I. Rudan
H. Campbell
A. F. Wright
J. F. Wilson
F. Agakov
P. Navarro
C. S. Haley
机构
[1] MRC Human Genetics Unit,
[2] MRC Institute of Genetics and Molecular Medicine,undefined
[3] University of Edinburgh,undefined
[4] The Roslin Institute and Royal (Dick) School of Veterinary Studies,undefined
[5] University of Edinburgh,undefined
[6] Centre for Population Health Sciences,undefined
[7] University of Edinburgh,undefined
[8] Pharmatics Limited,undefined
来源
关键词
D O I
暂无
中图分类号
学科分类号
摘要
In this study, we investigated the effect of five feature selection approaches on the performance of a mixed model (G-BLUP) and a Bayesian (Bayes C) prediction method. We predicted height, high density lipoprotein cholesterol (HDL) and body mass index (BMI) within 2,186 Croatian and into 810 UK individuals using genome-wide SNP data. Using all SNP information Bayes C and G-BLUP had similar predictive performance across all traits within the Croatian data and for the highly polygenic traits height and BMI when predicting into the UK data. Bayes C outperformed G-BLUP in the prediction of HDL, which is influenced by loci of moderate size, in the UK data. Supervised feature selection of a SNP subset in the G-BLUP framework provided a flexible, generalisable and computationally efficient alternative to Bayes C; but careful evaluation of predictive performance is required when supervised feature selection has been used.
引用
收藏
相关论文
共 50 条
  • [1] Application of high-dimensional feature selection: evaluation for genomic prediction in man
    Bermingham, M. L.
    Pong-Wong, R.
    Spiliopoulou, A.
    Hayward, C.
    Rudan, I.
    Campbell, H.
    Wright, A. F.
    Wilson, J. F.
    Agakov, F.
    Navarro, P.
    Haley, C. S.
    [J]. SCIENTIFIC REPORTS, 2015, 5
  • [2] High-dimensional feature selection for genomic datasets
    Afshar, Majid
    Usefi, Hamid
    [J]. KNOWLEDGE-BASED SYSTEMS, 2020, 206
  • [3] Unbiased Prediction and Feature Selection in High-Dimensional Survival Regression
    Laimighofer, Michael
    Krumsiek, Jan
    Buettner, Florian
    Theis, Fabian J.
    [J]. JOURNAL OF COMPUTATIONAL BIOLOGY, 2016, 23 (04) : 279 - 290
  • [4] Projective inference in high-dimensional problems: Prediction and feature selection
    Piironen, Juho
    Paasiniemi, Markus
    Vehtari, Aki
    [J]. ELECTRONIC JOURNAL OF STATISTICS, 2020, 14 (01): : 2155 - 2197
  • [5] High-dimensional genomic feature selection with the ordered stereotype logit model
    Seffernick, Anna Eames
    Mrozek, Krzysztof
    Nicolet, Deedra
    Stone, Richard M.
    Eisfeld, Ann-Kathrin
    Byrd, John C.
    Archer, Kellie J.
    [J]. BRIEFINGS IN BIOINFORMATICS, 2022, 23 (06)
  • [6] Prediction in high-dimensional linear models and application to genomic selection under imperfect linkage disequilibrium
    Rabier, Charles-Elie
    Grusea, Simona
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES C-APPLIED STATISTICS, 2021, 70 (04) : 1001 - 1026
  • [7] Feature selection for high-dimensional data
    Bolón-Canedo V.
    Sánchez-Maroño N.
    Alonso-Betanzos A.
    [J]. Progress in Artificial Intelligence, 2016, 5 (2) : 65 - 75
  • [8] Feature selection for high-dimensional data
    Destrero A.
    Mosci S.
    De Mol C.
    Verri A.
    Odone F.
    [J]. Computational Management Science, 2009, 6 (1) : 25 - 40
  • [9] Application of High-Dimensional Feature Selection in Near-Infrared Spectroscopy of Cigarettes' Qualitative Evaluation
    Qin Yuhua
    Ding Xiangqian
    Gong Huili
    [J]. SPECTROSCOPY LETTERS, 2013, 46 (06) : 397 - 402
  • [10] On the accuracy in high-dimensional linear models and its application to genomic selection
    Rabier, Charles-Elie
    Mangin, Brigitte
    Grusea, Simona
    [J]. SCANDINAVIAN JOURNAL OF STATISTICS, 2019, 46 (01) : 289 - 313