High-dimensional genomic feature selection with the ordered stereotype logit model

被引:3
|
作者
Seffernick, Anna Eames [1 ]
Mrozek, Krzysztof [2 ]
Nicolet, Deedra [2 ]
Stone, Richard M. [3 ]
Eisfeld, Ann-Kathrin [4 ,5 ]
Byrd, John C. [6 ]
Archer, Kellie J. [7 ]
机构
[1] Ohio State Univ, Columbus, OH 43210 USA
[2] Ohio State Univ, Clara D Bloonfield Ctr, Leukemia Outcomes Res, Comprehens Canc Ctr, Columbus, OH 43210 USA
[3] Dana Farber Canc Inst, Adult Acute Leukemia Program, Boston, MA 02115 USA
[4] Ohio State Comprehens Canc Ctr, Div Hematol, Columbus, OH USA
[5] Clara D Bloomfield Ctr Leukemia Outcomes Res, Bloomfield, NJ USA
[6] Univ Cincinnati, Coll Med, Dept Intnrnal Med, Cincinnati, OH 45221 USA
[7] Ohio State Univ, Div Biostat, Columbus, OH 43210 USA
基金
美国国家卫生研究院;
关键词
hierarchical model; ordinal response; variable selection; acute myeloid leukemia; ACUTE MYELOID-LEUKEMIA; VARIABLE SELECTION; BAYESIAN LASSO; REGRESSION; RECOMMENDATIONS; NORMALIZATION; ASSOCIATION; MANAGEMENT; DIAGNOSIS; SHRINKAGE;
D O I
10.1093/bib/bbac414
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
For many high-dimensional genomic and epigenomic datasets, the outcome of interest is ordinal. While these ordinal outcomes are often thought of as the observed cutpoints of some latent continuous variable, some ordinal outcomes are truly discrete and are comprised of the subjective combination of several factors. The nonlinear stereotype logistic model, which does not assume proportional odds, was developed for these 'assessed' ordinal variables. It has previously been extended to the frequentist high-dimensional feature selection setting, but the Bayesian framework provides some distinct advantages in terms of simultaneous uncertainty quantification and variable selection. Here, we review the stereotype model and Bayesian variable selection methods and demonstrate how to combine them to select genomic features associated with discrete ordinal outcomes. We compared the Bayesian and frequentist methods in terms of variable selection performance. We additionally applied the Bayesian stereotype method to an acute myeloid leukemia RNA-sequencing dataset to further demonstrate its variable selection abilities by identifying features associated with the European LeukemiaNet prognostic risk score.
引用
收藏
页数:10
相关论文
共 50 条
  • [41] Evaluating Feature Selection Robustness on High-Dimensional Data
    Pes, Barbara
    [J]. HYBRID ARTIFICIAL INTELLIGENT SYSTEMS (HAIS 2018), 2018, 10870 : 235 - 247
  • [42] Feature selection for classifying high-dimensional numerical data
    Wu, YM
    Zhang, AD
    [J]. PROCEEDINGS OF THE 2004 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 2, 2004, : 251 - 258
  • [43] Extremely High-Dimensional Feature Selection via Feature Generating Samplings
    Li, Shutao
    Wei, Dan
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2014, 44 (06) : 737 - 747
  • [44] High-Dimensional Feature Selection by Feature-Wise Kernelized Lasso
    Yamada, Makoto
    Jitkrittum, Wittawat
    Sigal, Leonid
    Xing, Eric P.
    Sugiyama, Masashi
    [J]. NEURAL COMPUTATION, 2014, 26 (01) : 185 - 207
  • [45] A new improved filter-based feature selection model for high-dimensional data
    Munirathinam, Deepak Raj
    Ranganadhan, Mohanasundaram
    [J]. JOURNAL OF SUPERCOMPUTING, 2020, 76 (08): : 5745 - 5762
  • [46] Comparison of biomarker selection methods in high-dimensional genomic data
    Wang, Y.
    Guo, S.
    [J]. EUROPEAN JOURNAL OF CANCER, 2022, 174 : S98 - S98
  • [47] A new improved filter-based feature selection model for high-dimensional data
    Deepak Raj Munirathinam
    Mohanasundaram Ranganadhan
    [J]. The Journal of Supercomputing, 2020, 76 : 5745 - 5762
  • [48] A Light Causal Feature Selection Approach to High-Dimensional Data
    Ling, Zhaolong
    Li, Ying
    Zhang, Yiwen
    Yu, Kui
    Zhou, Peng
    Li, Bo
    Wu, Xindong
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (08) : 7639 - 7650
  • [49] Single Sequence Fast Feature Selection for High-Dimensional Data
    Boldt, Francisco de Assis
    Rauber, Thomas W.
    Varejao, Flavio M.
    [J]. 2015 IEEE 27TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2015), 2015, : 697 - 704
  • [50] High-dimensional sign-constrained feature selection and grouping
    Qin, Shanshan
    Ding, Hao
    Wu, Yuehua
    Liu, Feng
    [J]. ANNALS OF THE INSTITUTE OF STATISTICAL MATHEMATICS, 2021, 73 (04) : 787 - 819