Methods of selecting informative variables

被引:0
|
作者
Fedorov, VV
Herzberg, AM
Leonov, SL
机构
[1] ClaxoSmithKline, Collegeville, PA 19426 USA
[2] Queens Univ, Dept Math & Stat, Kingston, ON K7L 3N6, Canada
关键词
dimension reduction; optimal experimental design; principal components; principal variables;
D O I
暂无
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
We propose a new method for selection of the most informative variables from the set of variables which can be measured directly. The information is measured by metrics similar to those used in experimental design theory, such as determinant of the dispersion matrix of prediction or various functions of its eigenvalues. The basic model admits both population variability and observational errors, which allows us to introduce algorithms based on ideas of optimal experimental design. Moreover, we can take into account cost of measuring various variables which makes the approach more practical. It is shown that the selection of optimal subsets of variables is invariant to scale transformations unlike other methods of dimension reduction, such as principal components analysis or methods based on direct selection of variables, for instance principal variables and battery reduction. The performance of different approaches is compared using the clinical data.
引用
收藏
页码:157 / 173
页数:17
相关论文
共 50 条
  • [1] Selecting Informative Variables in the Identification Problem
    Mihov, Eugene D.
    Nepomnyashchiy, Oleg V.
    JOURNAL OF SIBERIAN FEDERAL UNIVERSITY-MATHEMATICS & PHYSICS, 2016, 9 (04): : 473 - 480
  • [2] A forest-based algorithm for selecting informative variables using Variable Depth Distribution
    Voronov, Sergii
    Jung, Voronov Daniel
    Frisk, Erik
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2021, 97
  • [3] A strategy that iteratively retains informative variables for selecting optimal variable subset in multivariate calibration
    Yun, Yong-Huan
    Wang, Wei-Ting
    Tan, Min-Li
    Liang, Yi-Zeng
    Li, Hong-Dong
    Cao, Dong-Sheng
    Lu, Hong-Mei
    Xu, Qing-Song
    ANALYTICA CHIMICA ACTA, 2014, 807 : 36 - 43
  • [4] Methodology of selecting the most informative variables for decision-making problems of classification type
    Pudil, Pavel
    Somol, Petr
    Stritecky, Rudolf
    PROCEEDINGS OF THE SIXTH INTERNATIONAL CONFERENCE ON INFORMATION AND MANAGEMENT SCIENCES, 2007, 6 : 1 - 18
  • [5] A COMPARISON OF METHODS FOR SELECTING VALUES OF SIMULATION INPUT VARIABLES
    Ourbih-Tari, Megdouda
    Guebli, Sofia
    ESAIM-PROBABILITY AND STATISTICS, 2015, 19 : 135 - 147
  • [6] Selecting maximally informative genes
    Androulakis, IP
    COMPUTERS & CHEMICAL ENGINEERING, 2005, 29 (03) : 535 - 546
  • [7] Conditional and unconditional methods for selecting variables in linear mixed models
    Kubokawa, Tatsuya
    JOURNAL OF MULTIVARIATE ANALYSIS, 2011, 102 (03) : 641 - 660
  • [8] Selecting informative genes from microarray data by using hybrid methods for cancer classification
    Mohamad M.S.
    Omatu S.
    Deris S.
    Misman M.F.
    Yoshioka M.
    Artificial Life and Robotics, 2009, 13 (2) : 414 - 417
  • [9] An Approach to Selecting an Informative Feature in Software Identification
    Salakhutdinova, Kseniya
    Krivtsova, Irina
    Lebedev, Ilya
    Sukhoparov, Mikhail
    INTERNET OF THINGS, SMART SPACES, AND NEXT GENERATION NETWORKS AND SYSTEMS, NEW2AN 2018, 2018, 11118 : 318 - 327
  • [10] Learning Opinion Summarizers by Selecting Informative Reviews
    Brazinskas, Arthur
    Lapata, Mirella
    Titov, Ivan
    2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 9424 - 9442