Bayesian variable selection in multinomial probit model for classifying high-dimensional data

被引:0
|
作者
Aijun Yang
Yunxian Li
Niansheng Tang
Jinguan Lin
机构
[1] Nanjing Forestry University,College of Economics and Management
[2] Southeast University,School of Economics and Management
[3] Yunnan University of Economics and Finance,School of Finance
[4] Yunnan University,Department of Statistics
[5] Southeast University,Department of Mathematics
来源
Computational Statistics | 2015年 / 30卷
关键词
Bayesian stochastic search variable selection; Generalized ; -prior; Multi-class classification;
D O I
暂无
中图分类号
学科分类号
摘要
Selecting a small number of relevant genes for classification has received a great deal of attention in microarray data analysis. While the development of methods for microarray data with only two classes is relevant, developing more efficient algorithms for classification with any number of classes is important. In this paper, we propose a Bayesian stochastic search variable selection approach for multi-class classification, which can identify relevant genes by assessing sets of genes jointly. We consider a multinomial probit model with a generalized g\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$g$$\end{document}-prior for the regression coefficients. An efficient algorithm using simulation-based MCMC methods are developed for simulating parameters from the posterior distribution. This algorithm is robust to the choice of initial value, and produces posterior probabilities of relevant genes for biological interpretation. We demonstrate the performance of the approach with two well-known gene expression profiling data: leukemia data, lymphoma data, SRBCTs data and NCI60 data. Compared with other classification approaches, our approach selects smaller numbers of relevant genes and obtains competitive classification accuracy based on obtained results.
引用
收藏
页码:399 / 418
页数:19
相关论文
共 50 条
  • [1] Bayesian variable selection in multinomial probit model for classifying high-dimensional data
    Yang, Aijun
    Li, Yunxian
    Tang, Niansheng
    Lin, Jinguan
    [J]. COMPUTATIONAL STATISTICS, 2015, 30 (02) : 399 - 418
  • [2] Sparse Bayesian variable selection in multinomial probit regression model with application to high-dimensional data classification
    Yang Aijun
    Jiang Xuejun
    Xiang Liming
    Lin Jinguan
    [J]. COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2017, 46 (12) : 6137 - 6150
  • [3] Sparse Bayesian variable selection for classifying high-dimensional data
    Yang, Aijun
    Lian, Heng
    Jiang, Xuejun
    Liu, Pengfei
    [J]. STATISTICS AND ITS INTERFACE, 2018, 11 (02) : 385 - 395
  • [4] Sparse Bayesian variable selection in kernel probit model for analyzing high-dimensional data
    Aijun Yang
    Yuzhu Tian
    Yunxian Li
    Jinguan Lin
    [J]. Computational Statistics, 2020, 35 : 245 - 258
  • [5] Sparse Bayesian variable selection in kernel probit model for analyzing high-dimensional data
    Yang, Aijun
    Tian, Yuzhu
    Li, Yunxian
    Lin, Jinguan
    [J]. COMPUTATIONAL STATISTICS, 2020, 35 (01) : 245 - 258
  • [6] Sparse bayesian kernel multinomial probit regression model for high-dimensional data classification
    Yang, Aijun
    Jiang, Xuejun
    Shu, Lianjie
    Liu, Pengfei
    [J]. COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2019, 48 (01) : 165 - 176
  • [7] Bayesian variable selection and model averaging in high-dimensional multinomial nonparametric regression
    Yau, P
    Kohn, R
    Wood, S
    [J]. JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2003, 12 (01) : 23 - 54
  • [8] Sparse Bayesian multinomial probit regression model with correlation prior for high-dimensional data classification
    Yang Aijun
    Jiang Xuejun
    Liu Pengfei
    Lin Jinguan
    [J]. STATISTICS & PROBABILITY LETTERS, 2016, 119 : 241 - 247
  • [9] Bayesian variable selection in clustering high-dimensional data
    Tadesse, MG
    Sha, N
    Vannucci, M
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2005, 100 (470) : 602 - 617
  • [10] Bayesian variable selection for high-dimensional rank data
    Cui, Can
    Singh, Susheela P.
    Staicu, Ana-Maria
    Reich, Brian J.
    [J]. ENVIRONMETRICS, 2021, 32 (07)