Integrative analysis of multiple cancer genomic datasets under the heterogeneity model

被引:18
|
作者
Liu, Jin [1 ]
Huang, Jian [2 ]
Ma, Shuangge [1 ,3 ]
机构
[1] Yale Univ, Sch Publ Hlth, Dept Biostat, New Haven, CT 06520 USA
[2] Univ Iowa, Dept Biostat, Dept Stat & Actuarial Sci, Iowa City, IA 52242 USA
[3] VA Cooperat Studies Program Coordinating Ctr, West Haven, CT USA
基金
美国国家卫生研究院;
关键词
integrative analysis; heterogeneity model; marker selection; EXPRESSION; IDENTIFICATION; SELECTION; PROFILES;
D O I
10.1002/sim.5780
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
In the analysis of cancer studies with high-dimensional genomic measurements, integrative analysis provides an effective way of pooling information across multiple heterogeneous datasets. The genomic basis of multiple independent datasets, which can be characterized by the sets of genomic markers, can be described using the homogeneity model or heterogeneity model. Under the homogeneity model, all datasets share the same set of markers associated with responses. In contrast, under the heterogeneity model, different studies have overlapping but possibly different sets of markers. The heterogeneity model contains the homogeneity model as a special case and can be much more flexible. Marker selection under the heterogeneity model calls for bi-level selection to determine whether a covariate is associated with response in any study at all as well as in which studies it is associated with responses. In this study, we consider two minimax concave penalty-based penalization approaches for marker selection under the heterogeneity model. For each approach, we describe its rationale and an effective computational algorithm. We conduct simulations to investigate their performance and compare with the existing alternatives. We also apply the proposed approaches to the analysis of gene expression data on multiple cancers. Copyright (c) 2013 John Wiley & Sons, Ltd.
引用
收藏
页码:3509 / 3521
页数:13
相关论文
共 50 条
  • [1] Integrative Analysis of Multiple Cancer Prognosis Datasets Under the Heterogeneity Model
    Liu, Jin
    Huang, Jian
    Ma, Shuangge
    [J]. TOPICS IN APPLIED STATISTICS, 2013, 55 : 257 - 269
  • [2] Integrative analysis of multiple cancer genomic datasets under the heterogeneity model (vol 32, pg 3509, 2013)
    Ma, Shuangge
    [J]. STATISTICS IN MEDICINE, 2013, 32 (20) : 3594 - 3594
  • [3] Integrative prescreening in analysis of multiple cancer genomic studies
    Rui Song
    Jian Huang
    Shuangge Ma
    [J]. BMC Bioinformatics, 13
  • [4] Integrative prescreening in analysis of multiple cancer genomic studies
    Song, Rui
    Huang, Jian
    Ma, Shuangge
    [J]. BMC BIOINFORMATICS, 2012, 13
  • [5] Sparse group penalized integrative analysis of multiple cancer prognosis datasets
    Liu, Jin
    Huang, Jian
    Xie, Yang
    Ma, Shuangge
    [J]. GENETICS RESEARCH, 2013, 95 (2-3) : 68 - 77
  • [6] Integrative analysis of multiple genomic variables using a hierarchical Bayesian model
    Schaefer, Martin
    Klein, Hans-Ulrich
    Schwender, Holger
    [J]. BIOINFORMATICS, 2017, 33 (20) : 3220 - 3227
  • [7] Genomic heterogeneity of multiple synchronous lung cancer
    Liu, Yu
    Zhang, Jianjun
    Li, Lin
    Yin, Guangliang
    Zhang, Jianhua
    Zheng, Shan
    Cheung, Hannah
    Wu, Ning
    Lu, Ning
    Mao, Xizeng
    Yang, Longhai
    Zhang, Jiexin
    Zhang, Li
    Seth, Sahil
    Chen, Huang
    Song, Xingzhi
    Liu, Kan
    Xie, Yongqiang
    Zhou, Lina
    Zhao, Chuanduo
    Han, Naijun
    Chen, Wenting
    Zhang, Susu
    Chen, Longyun
    Cai, Wenjun
    Li, Lin
    Shen, Miaozhong
    Xu, Ningzhi
    Cheng, Shujun
    Yang, Huanming
    Lee, J. Jack
    Correa, Arlene
    Fujimoto, Junya
    Behrens, Carmen
    Chow, Chi-Wan
    William, William N.
    Heymach, John V.
    Hong, Waun Ki
    Swisher, Stephen
    Wistuba, Ignacio I.
    Wang, Jun
    Lin, Dongmei
    Liu, Xiangyang
    Futreal, P. Andrew
    Gao, Yanning
    [J]. NATURE COMMUNICATIONS, 2016, 7
  • [8] Genomic heterogeneity of multiple synchronous lung cancer
    Yu Liu
    Jianjun Zhang
    Lin Li
    Guangliang Yin
    Jianhua Zhang
    Shan Zheng
    Hannah Cheung
    Ning Wu
    Ning Lu
    Xizeng Mao
    Longhai Yang
    Jiexin Zhang
    Li Zhang
    Sahil Seth
    Huang Chen
    Xingzhi Song
    Kan Liu
    Yongqiang Xie
    Lina Zhou
    Chuanduo Zhao
    Naijun Han
    Wenting Chen
    Susu Zhang
    Longyun Chen
    Wenjun Cai
    Lin Li
    Miaozhong Shen
    Ningzhi Xu
    Shujun Cheng
    Huanming Yang
    J. Jack Lee
    Arlene Correa
    Junya Fujimoto
    Carmen Behrens
    Chi-Wan Chow
    William N. William
    John V. Heymach
    Waun Ki Hong
    Stephen Swisher
    Ignacio I. Wistuba
    Jun Wang
    Dongmei Lin
    Xiangyang Liu
    P. Andrew Futreal
    Yanning Gao
    [J]. Nature Communications, 7
  • [9] Integrative classification and analysis of multiple arrayCGH datasets with probe alignment
    Ze Tian
    Rui Kuang
    [J]. BIOINFORMATICS, 2010, 26 (18) : 2313 - 2320
  • [10] Penalized integrative semiparametric interaction analysis for multiple genetic datasets
    Li, Yang
    Li, Rong
    Lin, Cunjie
    Qin, Yichen
    Ma, Shuangge
    [J]. STATISTICS IN MEDICINE, 2019, 38 (17) : 3221 - 3242