Causal graph-based analysis of genome-wide association data in rheumatoid arthritis

被引:15
|
作者
Alekseyenko, Alexander V. [1 ,2 ]
Lytkin, Nikita I. [1 ]
Ai, Jizhou [1 ]
Ding, Bo [3 ]
Padyukov, Leonid [4 ,5 ]
Aliferis, Constantin F. [1 ,6 ,7 ]
Statnikov, Alexander [1 ,2 ]
机构
[1] NYU, Sch Med, Ctr Hlth Informat & Bioinformat, New York, NY 10016 USA
[2] NYU, Sch Med, Dept Med, New York, NY 10016 USA
[3] Karolinska Inst, Inst Environm Med, SE-17177 Stockholm, Sweden
[4] Karolinska Inst, Dept Med, Rheumatol Unit, SE-17176 Stockholm, Sweden
[5] Karolinska Univ Hosp Solna, SE-17176 Stockholm, Sweden
[6] NYU, Sch Med, Dept Pathol, New York, NY 10016 USA
[7] Vanderbilt Univ, Dept Biostat, Nashville, TN 37232 USA
基金
瑞典研究理事会; 美国国家卫生研究院;
关键词
MARKOV BLANKET INDUCTION; MOLECULAR SIGNATURE; FEATURE-SELECTION; LOCAL CAUSAL; MICROARRAY; DISCOVERY; DIAGNOSIS;
D O I
10.1186/1745-6150-6-25
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background: GWAS owe their popularity to the expectation that they will make a major impact on diagnosis, prognosis and management of disease by uncovering genetics underlying clinical phenotypes. The dominant paradigm in GWAS data analysis so far consists of extensive reliance on methods that emphasize contribution of individual SNPs to statistical association with phenotypes. Multivariate methods, however, can extract more information by considering associations of multiple SNPs simultaneously. Recent advances in other genomics domains pinpoint multivariate causal graph-based inference as a promising principled analysis framework for high-throughput data. Designed to discover biomarkers in the local causal pathway of the phenotype, these methods lead to accurate and highly parsimonious multivariate predictive models. In this paper, we investigate the applicability of causal graph-based method TIE* to analysis of GWAS data. To test the utility of TIE*, we focus on anti-CCP positive rheumatoid arthritis (RA) GWAS datasets, where there is a general consensus in the community about the major genetic determinants of the disease. Results: Application of TIE* to the North American Rheumatoid Arthritis Cohort (NARAC) GWAS data results in six SNPs, mostly from the MHC locus. Using these SNPs we develop two predictive models that can classify cases and disease-free controls with an accuracy of 0.81 area under the ROC curve, as verified in independent testing data from the same cohort. The predictive performance of these models generalizes reasonably well to Swedish subjects from the closely related but not identical Epidemiological Investigation of Rheumatoid Arthritis (EIRA) cohort with 0.71-0.78 area under the ROC curve. Moreover, the SNPs identified by the TIE* method render many other previously known SNP associations conditionally independent of the phenotype. Conclusions: Our experiments demonstrate that application of TIE* captures maximum amount of genetic information about RA in the data and recapitulates the major consensus findings about the genetic factors of this disease. In addition, TIE* yields reproducible markers and signatures of RA. This suggests that principled multivariate causal and predictive framework for GWAS analysis empowers the community with a new tool for high-quality and more efficient discovery.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Causal graph-based analysis of genome-wide association data in rheumatoid arthritis
    Alexander V Alekseyenko
    Nikita I Lytkin
    Jizhou Ai
    Bo Ding
    Leonid Padyukov
    Constantin F Aliferis
    Alexander Statnikov
    [J]. Biology Direct, 6
  • [2] An integrated genome-wide association analysis on rheumatoid arthritis data
    Jun Zhang
    Xiaofeng Zhu
    Richard S Cooper
    [J]. BMC Proceedings, 1 (Suppl 1)
  • [3] Genome-wide association analysis of rheumatoid arthritis data via haplotype sharing
    Andrew S Allen
    Glen A Satten
    [J]. BMC Proceedings, 3 (Suppl 7)
  • [4] Pathway analysis of genome-wide association studies on rheumatoid arthritis
    Song, G. G.
    Bae, S. -C.
    Lee, Y. H.
    [J]. CLINICAL AND EXPERIMENTAL RHEUMATOLOGY, 2013, 31 (04) : 566 - 574
  • [5] Pathway Analysis of Genome-Wide Association Studies On Rheumatoid Arthritis
    Lee, Young Ho
    Choi, Sung Jae
    Ji, Jong Dae
    Song, Gwan Gyu
    [J]. ARTHRITIS AND RHEUMATISM, 2012, 64 (10): : S179 - S179
  • [6] Application of imputation methods to the analysis of rheumatoid arthritis data in genome-wide association studies
    Douglas K Childers
    Guolian Kang
    Nianjun Liu
    Guimin Gao
    Kui Zhang
    [J]. BMC Proceedings, 3 (Suppl 7)
  • [7] Genome-wide pathway analysis of genome-wide association studies on systemic lupus erythematosus and rheumatoid arthritis
    Young Ho Lee
    Sang-Cheol Bae
    Sung Jae Choi
    Jong Dae Ji
    Gwan Gyu Song
    [J]. Molecular Biology Reports, 2012, 39 : 10627 - 10635
  • [8] Genome-wide pathway analysis of genome-wide association studies on systemic lupus erythematosus and rheumatoid arthritis
    Lee, Young Ho
    Bae, Sang-Cheol
    Choi, Sung Jae
    Ji, Jong Dae
    Song, Gwan Gyu
    [J]. MOLECULAR BIOLOGY REPORTS, 2012, 39 (12) : 10627 - 10635
  • [9] Locus category based analysis of a large genome-wide association study of rheumatoid arthritis
    Freudenberg, Jan
    Lee, Annette T.
    Siminovitch, Katherine A.
    Amos, Christopher I.
    Ballard, David
    Li, Wentian
    Gregersen, Peter K.
    [J]. HUMAN MOLECULAR GENETICS, 2010, 19 (19) : 3863 - 3872
  • [10] Genome-wide linkage and association analysis of rheumatoid arthritis in a Canadian population
    Zhi Wei
    Mingyao Li
    [J]. BMC Proceedings, 1 (Suppl 1)