A Two-Stage Random Forest-Based Pathway Analysis Method

被引:20
|
作者
Chung, Ren-Hua [1 ,2 ]
Chen, Ying-Erh [3 ]
机构
[1] Natl Hlth Res Inst, Inst Populat Hlth Sci, Div Biostat & Bioinformat, Zhunan, Miaoli, Taiwan
[2] Univ Miami, Miller Sch Med, John P Hussman Inst Human Gen, Ctr Genet Epidemiol & Stat Genet, Miami, FL 33136 USA
[3] N Carolina State Univ, Dept Econ, Raleigh, NC 27695 USA
来源
PLOS ONE | 2012年 / 7卷 / 05期
关键词
GENE-GENE; ASSOCIATION; SUSCEPTIBILITY; POLYMORPHISMS; GENOTYPES; RISK; MDM2; SNPS;
D O I
10.1371/journal.pone.0036662
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Pathway analysis provides a powerful approach for identifying the joint effect of genes grouped into biologically-based pathways on disease. Pathway analysis is also an attractive approach for a secondary analysis of genome-wide association study (GWAS) data that may still yield new results from these valuable datasets. Most of the current pathway analysis methods focused on testing the cumulative main effects of genes in a pathway. However, for complex diseases, gene-gene interactions are expected to play a critical role in disease etiology. We extended a random forest-based method for pathway analysis by incorporating a two-stage design. We used simulations to verify that the proposed method has the correct type I error rates. We also used simulations to show that the method is more powerful than the original random forest-based pathway approach and the set-based test implemented in PLINK in the presence of gene-gene interactions. Finally, we applied the method to a breast cancer GWAS dataset and a lung cancer GWAS dataset and interesting pathways were identified that have implications for breast and lung cancers.
引用
收藏
页数:6
相关论文
共 50 条
  • [41] A random forest-based analysis of household survey data to infer insights on digital inequality
    Nischal Regmi
    [J]. Iran Journal of Computer Science, 2023, 6 (4) : 333 - 344
  • [42] A Quantile Regression Random Forest-Based Short-Term Load Probabilistic Forecasting Method
    Dang, Sanlei
    Peng, Long
    Zhao, Jingming
    Li, Jiajie
    Kong, Zhengmin
    [J]. ENERGIES, 2022, 15 (02)
  • [43] PCirc: random forest-based plant circRNA identification software
    Yin, Shuwei
    Tian, Xiao
    Zhang, Jingjing
    Sun, Peisen
    Li, Guanglin
    [J]. BMC BIOINFORMATICS, 2021, 22 (01)
  • [44] Prostatectomy by the two-stage method
    Irwin, WK
    [J]. BRITISH MEDICAL JOURNAL, 1937, 1937 : 639 - 640
  • [45] Prostatectomy by the Two-stage Method
    Galbraith, WW
    [J]. BRITISH MEDICAL JOURNAL, 1937, 1937 : 472 - 473
  • [46] Prostatectomy by the two-stage method
    Irwin, WK
    [J]. BRITISH MEDICAL JOURNAL, 1937, 1937 : 528 - 529
  • [47] Random forest-based robust classification for lithographic hotspot detection
    Dewar, Rohit
    Barai, Samit
    Kumar, Pardeep
    Srinivasan, Babji
    Mohapatra, Nihar R.
    [J]. JOURNAL OF MICRO-NANOLITHOGRAPHY MEMS AND MOEMS, 2019, 18 (02):
  • [48] Random Forest-based Approach for Classifying Customers in Social CRM
    Lamrhari, Soumaya
    Elghazi, Hamid
    El Faker, Abdellatif
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON TECHNOLOGY MANAGEMENT, OPERATIONS AND DECISIONS (ICTMOD), 2020,
  • [49] Prostatectomy by the two-stage method
    Galbraith, WW
    [J]. BRITISH MEDICAL JOURNAL, 1937, 1937 : 638 - 639
  • [50] A Random Forest-Based Leaf Classification Using Multiple Features
    Hazra, Dipankar
    Bhattacharyya, Debnath
    Kim, Tai-hoon
    [J]. Advances in Intelligent Systems and Computing, 2021, 1280 : 227 - 239