Cancer Outlier Analysis Based on Mixture Modeling of Gene Expression Data

被引:4
|
作者
Mori, Keita [1 ,2 ]
Oura, Tomonori [3 ]
Noma, Hisashi [4 ]
Matsui, Shigeyuki [1 ,4 ]
机构
[1] Grad Univ Adv Studies, Dept Stat Sci, Sch Multidisciplinary Sci, Tachikawa, Tokyo 1908562, Japan
[2] Shizuoka Canc Ctr, Clin Trial Coordinat Off, Nagaizumi, Shizuoka 4118777, Japan
[3] Eli Lilly Japan, Asia Pacific Stat Sci, Lilly Res Labs, Dev Ctr Excellence Asia Pacific,Chuo Ku, Kobe, Hyogo 6510086, Japan
[4] Inst Stat Math, Dept Data Sci, Tachikawa, Tokyo 1908562, Japan
关键词
SAMPLE-SIZE; FDR-CONTROL; FUSION;
D O I
10.1155/2013/693901
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Molecular heterogeneity of cancer, partially caused by various chromosomal aberrations or gene mutations, can yield substantial heterogeneity in gene expression profile in cancer samples. To detect cancer-related genes which are active only in a subset of cancer samples or cancer outliers, several methods have been proposed in the context of multiple testing. Such cancer outlier analyses will generally suffer from a serious lack of power, compared with the standard multiple testing setting where common activation of genes across all cancer samples is supposed. In this paper, we consider information sharing across genes and cancer samples, via a parametric normal mixture modeling of gene expression levels of cancer samples across genes after a standardization using the reference, normal sample data. A gene-based statistic for gene selection is developed on the basis of a posterior probability of cancer outlier for each cancer sample. Some efficiency improvement by using our method was demonstrated, even under settings with misspecified, heavy-tailed t-distributions. An application to a real dataset from hematologic malignancies is provided.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] Outlier analysis for gene expression data
    Yan, C
    Chen, GL
    Shen, YF
    [J]. JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2004, 19 (01) : 13 - 21
  • [2] Outlier analysis for gene expression data
    Chao Yan
    Guo-Liang Chen
    Yi-Fei Shen
    [J]. Journal of Computer Science and Technology, 2004, 19 : 13 - 21
  • [3] Mixture modeling of microarray gene expression data
    Yang Yang
    Adam P Tashman
    Jung Yeon Lee
    Seungtai Yoon
    Wenyang Mao
    Kwangmi Ahn
    Wonkuk Kim
    Nancy R Mendell
    Derek Gordon
    Stephen J Finch
    [J]. BMC Proceedings, 1 (Suppl 1)
  • [4] Gene Selection for Cancer Clustering Analysis Based on Expression Data
    Xu, Taosheng
    Su, Ning
    Wang, Rujing
    Song, Liangtu
    [J]. PROCEEDINGS OF 2015 4TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT 2015), 2015, : 516 - 519
  • [5] Cancer outlier differential gene expression detection
    Wu, Baolin
    [J]. BIOSTATISTICS, 2007, 8 (03) : 566 - 575
  • [6] Mixture model on the variance for the differential analysis of gene expression data
    Delmar, P
    Robin, S
    Tronik-Le Roux, D
    Daudin, JJ
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES C-APPLIED STATISTICS, 2005, 54 : 31 - 50
  • [7] Analysis of Microarray Gene Expression Data Using a Mixture Model
    Bartolucci, Al
    Allison, David B.
    Bae, Sejong
    Singh, Karan P.
    [J]. MODSIM 2007: INTERNATIONAL CONGRESS ON MODELLING AND SIMULATION: LAND, WATER AND ENVIRONMENTAL MANAGEMENT: INTEGRATED SYSTEMS FOR SUSTAINABILITY, 2007, : 2867 - 2869
  • [8] A mixture model approach for the analysis of microarray gene expression data
    Allison, David B.
    Gadbury, Gary L.
    Heo, Moonseong
    Fernández, José R.
    Lee, Cheol-Koo
    Prolla, Tomas A.
    Weindruch, Richard
    [J]. Computational Statistics and Data Analysis, 2002, 38 (05): : 1 - 20
  • [9] A mixture model approach for the analysis of microarray gene expression data
    Allison, DB
    Gadbury, GL
    Heo, MS
    Fernández, JR
    Lee, CK
    Prolla, TA
    Weindruch, R
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2002, 39 (01) : 1 - 20
  • [10] Outlier sums for differential gene expression analysis
    Tibshirani, Robert
    Hastie, Trevor
    [J]. BIOSTATISTICS, 2007, 8 (01) : 2 - 8