Methods of selecting informative variables

被引:0
|
作者
Fedorov, VV
Herzberg, AM
Leonov, SL
机构
[1] ClaxoSmithKline, Collegeville, PA 19426 USA
[2] Queens Univ, Dept Math & Stat, Kingston, ON K7L 3N6, Canada
关键词
dimension reduction; optimal experimental design; principal components; principal variables;
D O I
暂无
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
We propose a new method for selection of the most informative variables from the set of variables which can be measured directly. The information is measured by metrics similar to those used in experimental design theory, such as determinant of the dispersion matrix of prediction or various functions of its eigenvalues. The basic model admits both population variability and observational errors, which allows us to introduce algorithms based on ideas of optimal experimental design. Moreover, we can take into account cost of measuring various variables which makes the approach more practical. It is shown that the selection of optimal subsets of variables is invariant to scale transformations unlike other methods of dimension reduction, such as principal components analysis or methods based on direct selection of variables, for instance principal variables and battery reduction. The performance of different approaches is compared using the clinical data.
引用
收藏
页码:157 / 173
页数:17
相关论文
共 50 条
  • [31] Selecting Informative Features for Post-hoc Community Explanation
    Sadler, Sophie
    Greene, Derek
    Archambault, Daniel
    COMPLEX NETWORKS & THEIR APPLICATIONS X, VOL 1, 2022, 1015 : 297 - 308
  • [32] Integer programming for selecting set of informative markers in paternity inference
    Nishiyama, Soichiro
    Sato, Kengo
    Tao, Ryutaro
    BMC BIOINFORMATICS, 2022, 23 (01)
  • [33] Selecting Informative Universum Sample for Semi-Supervised Learning
    Chen, Shuo
    Zhang, Changshui
    21ST INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI-09), PROCEEDINGS, 2009, : 1016 - 1021
  • [34] Selecting informative rules with parallel genetic algorithm in classification problem
    Sarkar, Bikash Kanti
    Sana, Shib Sankar
    Chaudhuri, Kripasindhu
    APPLIED MATHEMATICS AND COMPUTATION, 2011, 218 (07) : 3247 - 3264
  • [35] Selecting maximally informative sibships for QTL linkage and association analysis
    Purcell, S
    Cherny, SS
    Sham, PC
    MOLECULAR PSYCHIATRY, 1999, 4 : S11 - S11
  • [36] Selecting Most Informative Contributors with Unknown Costs for Budgeted Crowdsensing
    Yang, Shuo
    Wu, Fan
    Tang, Shaojie
    Luo, Tie
    Gao, Xiaofeng
    Kong, Linghe
    Chen, Guihai
    2016 IEEE/ACM 24TH INTERNATIONAL SYMPOSIUM ON QUALITY OF SERVICE (IWQOS), 2016,
  • [37] Distance-Entropy: An Effective Indicator for Selecting Informative Data
    Li, Yang
    Chao, Xuewei
    FRONTIERS IN PLANT SCIENCE, 2022, 12
  • [38] Selecting Informative Genes by Lasso and Dantzig Selector for Linear Classifiers
    Zheng, Songfeng
    Liu, Weixiang
    2010 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2010, : 677 - 680
  • [39] Selecting maximally informative sibships for QTL linkage analysis.
    Cherny, SS
    Purcell, S
    Rijsdijk, F
    Hewitt, JK
    Sham, PC
    BEHAVIOR GENETICS, 1999, 29 (05) : 352 - 352
  • [40] Selecting maximally informative sibships for OTL association analysis.
    Purcell, S
    Cherny, SS
    Rijsdijk, F
    Hewitt, JK
    Sham, PC
    BEHAVIOR GENETICS, 1999, 29 (05) : 367 - 367