Multivariate Boosting for Integrative Analysis of High-Dimensional Cancer Genomic Data

被引:1
|
作者
Xiong, Lie [1 ]
Kuan, Pei-Fen [2 ]
Tian, Jianan [1 ]
Keles, Sunduz [1 ,3 ]
Wang, Sijian [1 ,3 ]
机构
[1] Univ Wisconsin, Dept Stat, Madison, WI 53706 USA
[2] SUNY Stony Brook, Dept Appl Math & Stat, Stony Brook, NY 11794 USA
[3] Univ Wisconsin, Dept Biostat & Med Informat, Madison, WI 53706 USA
来源
CANCER INFORMATICS | 2014年 / 13卷
关键词
boosting; breast cancer; integrative genomic analysis; multivariate regression;
D O I
10.4137/CIN.S16353
中图分类号
R73 [肿瘤学];
学科分类号
100214 ;
摘要
In this paper, we propose a novel multivariate component-wise boosting method for fitting multivariate response regression models under the high-dimension, low sample size setting. Our method is motivated by modeling the association among different biological molecules based on multiple types of high-dimensional genomic data. Particularly, we are interested in two applications: studying the influence of DNA copy number alterations on RNA transcript levels and investigating the association between DNA methylation and gene expression. For this purpose, we model the dependence of the RNA expression levels on DNA copy number alterations and the dependence of gene expression on DNA methylation through multivariate regression models and utilize boosting-type method to handle the high dimensionality as well as model the possible nonlinear associations. The performance of the proposed method is demonstrated through simulation studies. Finally, our multivariate boosting method is applied to two breast cancer studies.
引用
收藏
页码:123 / 131
页数:9
相关论文
共 50 条
  • [1] Integrative Sparse Bayesian Analysis of High-dimensional Multi-platform Genomic Data in Glioblastoma
    Bhadra, Anindya
    Baladandayuthapani, Veerabhadran
    [J]. 2013 IEEE INTERNATIONAL WORKSHOP ON GENOMIC SIGNAL PROCESSING AND STATISTICS (GENSIPS 2013), 2013, : 1 - 4
  • [2] High-dimensional integrative copula discriminant analysis for multiomics data
    He, Yong
    Chen, Hao
    Sun, Hao
    Ji, Jiadong
    Shi, Yufeng
    Zhang, Xinsheng
    Liu, Lei
    [J]. STATISTICS IN MEDICINE, 2020, 39 (30) : 4869 - 4884
  • [3] Boosting for high-multivariate responses in high-dimensional linear regression
    Lutz, Roman Werner
    Buehlmann, Peter
    [J]. STATISTICA SINICA, 2006, 16 (02) : 471 - 494
  • [4] Sparse redundancy analysis of high-dimensional genetic and genomic data
    Csala, Attila
    Voorbraak, Frans P. J. M.
    Zwinderman, Aeilko H.
    Hof, Michel H.
    [J]. BIOINFORMATICS, 2017, 33 (20) : 3228 - 3234
  • [5] Generalized Linear Discriminant Analysis for High-Dimensional Genomic Data
    Li, Sisi
    Lewinger, Juan Pablo
    [J]. GENETIC EPIDEMIOLOGY, 2017, 41 (07) : 704 - 704
  • [6] Generalized linear discriminant analysis for high-dimensional genomic data
    Li, Sisi
    Lewinger, Juan Pablo
    [J]. GENETIC EPIDEMIOLOGY, 2018, 42 (07) : 713 - 713
  • [7] Individual Data Protected Integrative Regression Analysis of High-Dimensional Heterogeneous Data
    Cai, Tianxi
    Liu, Molei
    Xia, Yin
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2022, 117 (540) : 2105 - 2119
  • [8] High-dimensional multivariate probit analysis
    Bock, RD
    Gibbons, RD
    [J]. BIOMETRICS, 1996, 52 (04) : 1183 - 1194
  • [9] iBAG: integrative Bayesian analysis of high-dimensional multiplatform genomics data
    Wang, Wenting
    Baladandayuthapani, Veerabhadran
    Morris, Jeffrey S.
    Broom, Bradley M.
    Manyam, Ganiraju
    Do, Kim-Anh
    [J]. BIOINFORMATICS, 2013, 29 (02) : 149 - 159
  • [10] VISUALIZING STRUCTURE IN HIGH-DIMENSIONAL MULTIVARIATE DATA
    YOUNG, FW
    RHEINGANS, P
    [J]. IBM JOURNAL OF RESEARCH AND DEVELOPMENT, 1991, 35 (1-2) : 97 - 107