Methods of selecting informative variables

被引:0
|
作者
Fedorov, VV
Herzberg, AM
Leonov, SL
机构
[1] ClaxoSmithKline, Collegeville, PA 19426 USA
[2] Queens Univ, Dept Math & Stat, Kingston, ON K7L 3N6, Canada
关键词
dimension reduction; optimal experimental design; principal components; principal variables;
D O I
暂无
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
We propose a new method for selection of the most informative variables from the set of variables which can be measured directly. The information is measured by metrics similar to those used in experimental design theory, such as determinant of the dispersion matrix of prediction or various functions of its eigenvalues. The basic model admits both population variability and observational errors, which allows us to introduce algorithms based on ideas of optimal experimental design. Moreover, we can take into account cost of measuring various variables which makes the approach more practical. It is shown that the selection of optimal subsets of variables is invariant to scale transformations unlike other methods of dimension reduction, such as principal components analysis or methods based on direct selection of variables, for instance principal variables and battery reduction. The performance of different approaches is compared using the clinical data.
引用
收藏
页码:157 / 173
页数:17
相关论文
共 50 条
  • [41] A COMPARISON OF THREE METHODS FOR SELECTING VALUES OF INPUT VARIABLES IN THE ANALYSIS OF OUTPUT FROM A COMPUTER CODE
    MCKAY, MD
    BECKMAN, RJ
    CONOVER, WJ
    TECHNOMETRICS, 1979, 21 (02) : 239 - 245
  • [42] A comparison of three methods for selecting values of input variables in the analysis of output from a computer code
    Mckay, MD
    Beckman, RJ
    Conover, WJ
    TECHNOMETRICS, 2000, 42 (01) : 55 - 61
  • [43] Selecting variables for neural network committees
    Bacauskiene, Marija
    Cibulskis, Vladas
    Verikas, Antanas
    ADVANCES IN NEURAL NETWORKS - ISNN 2006, PT 1, 2006, 3971 : 837 - 842
  • [44] Selecting input variables for fuzzy models
    Chin, SL
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 1996, 4 (04) : 243 - 256
  • [45] SELECTING THE BEST INSTRUMENTAL VARIABLES ESTIMATOR
    MAGDALINOS, MA
    REVIEW OF ECONOMIC STUDIES, 1985, 52 (03): : 473 - 485
  • [46] SIMPLIFICATION OF THE SAPS BY SELECTING INDEPENDENT VARIABLES
    VIVIAND, X
    GOUVERNET, J
    GRANTHIL, C
    FRANCOIS, G
    INTENSIVE CARE MEDICINE, 1991, 17 (03) : 164 - 168
  • [47] Prediction of urinary tract infection using machine learning methods: a study for finding the most-informative variables
    Farashi, Sajjad
    Momtaz, Hossein Emad
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2025, 25 (01)
  • [49] Informative and Uninformative Variables in the Scientific Registry of Transplant Recipients
    Hsich, E. M.
    Thuita, L.
    McNamara, D.
    Rogers, J. G.
    Schold, J.
    Blackstone, E. H.
    Ishwaran, H.
    JOURNAL OF HEART AND LUNG TRANSPLANTATION, 2017, 36 (04): : S387 - S387
  • [50] Selecting informative conformal prediction sets with false coverage rate control
    Gazin, Ulysse
    Heller, Ruth
    Marandon, Ariane
    Roquain, Etienne
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2025,