Group additive regression models for genomic data analysis

被引:35
|
作者
Luan, Yihui [1 ,2 ]
Li, Hongzhe [1 ]
机构
[1] Univ Penn, Sch Med, Dept Biostat & Epidemiol, Philadelphia, PA 19104 USA
[2] Shandong Univ, Sch Math & Syst Sci, Shandong 250100, Peoples R China
关键词
AFT models; boosting; gradient descent boosting; pathway;
D O I
10.1093/biostatistics/kxm015
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
One important problem in genomic research is to identify genomic features such as gene expression data or DNA single nucleotide polymorphisms (SNPs) that are related to clinical phenotypes. Often these genomic data can be naturally divided into biologically meaningful groups such as genes belonging to the same pathways or SNPs within genes. In this paper, we propose group additive regression models and a group gradient descent boosting procedure for identifying groups of genomic features that are related to clinical phenotypes. Our simulation results show that by dividing the variables into appropriate groups, we can obtain better identification of the group features that are related to the phenotypes. In addition, the prediction mean square errors are also smaller than the component-wise boosting procedure. We demonstrate the application of the methods to pathway-based analysis of microarray gene expression data of breast cancer. Results from analysis of a breast cancer microarray gene expression data set indicate that the pathways of metalloendopeptidases (MMPs) and MMP inhibitors, as well as cell proliferation, cell growth, and maintenance are important to breast cancer-specific survival.
引用
收藏
页码:100 / 113
页数:14
相关论文
共 50 条
  • [31] Estimation of fixed effects panel data partially linear additive regression models
    Ai, Chunrong
    You, Jinhong
    Zhou, Yong
    [J]. ECONOMETRICS JOURNAL, 2014, 17 (01): : 83 - 106
  • [32] A class of additive-accelerated means regression models for recurrent event data
    Liu Li
    Mu XiaoYun
    Sun LiuQuan
    [J]. SCIENCE CHINA-MATHEMATICS, 2010, 53 (12) : 3139 - 3151
  • [33] A class of additive-accelerated means regression models for recurrent event data
    Li Liu
    XiaoYun Mu
    LiuQuan Sun
    [J]. Science China Mathematics, 2010, 53 : 3139 - 3151
  • [34] On additive perturbations in nonlinear regression models
    Potocky, R
    Ban, TV
    [J]. TATRA MOUNTAINS MATHEMATICAL PUBLICATIONS, VOL 17, 1998, : 65 - 70
  • [35] Optimal estimation in additive regression models
    Horowitz, J
    Klemelä, J
    Mammen, E
    [J]. BERNOULLI, 2006, 12 (02) : 271 - 298
  • [36] ADDITIVE REGRESSION AND OTHER NONPARAMETRIC MODELS
    STONE, CJ
    [J]. ANNALS OF STATISTICS, 1985, 13 (02): : 689 - 705
  • [37] Additive isotonic regression models in epidemiology
    Morton-Jones, T
    Diggle, P
    Parker, L
    Dickinson, HO
    Binks, K
    [J]. STATISTICS IN MEDICINE, 2000, 19 (06) : 849 - 859
  • [38] Semiparametric mixture of additive regression models
    Zhang, Yi
    Zheng, Qingle
    [J]. COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2018, 47 (03) : 681 - 697
  • [39] Spike-and-Slab Group Lassos for Grouped Regression and Sparse Generalized Additive Models
    Bai, Ray
    Moran, Gemma E.
    Antonelli, Joseph L.
    Chen, Yong
    Boland, Mary R.
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2022, 117 (537) : 184 - 197
  • [40] Semiparametric analysis of additive isotonic errors-in-variables regression models
    Sun, Zhimeng
    Zhang, Zhongzhan
    [J]. STATISTICS & PROBABILITY LETTERS, 2013, 83 (01) : 100 - 114