A genome-wide approach to link genotype to clinical outcome by utilizing next generation sequencing and gene chip data of 6,697 breast cancer patients

被引:65
|
作者
Pongor, Lorinc [1 ,2 ]
Kormos, Mate [1 ]
Hatzis, Christos [3 ]
Pusztai, Lajos [3 ]
Szabo, Andras [2 ]
Gyorffy, Balazs [1 ,2 ,4 ]
机构
[1] MTA TTK Lendulet Canc Biomarker Res Grp, Res Ctr Nat Sci, H-1117 Budapest, Hungary
[2] Semmelweis Univ, Dept Pediat 2, H-1085 Budapest, Hungary
[3] Yale Univ, Sch Med, Yale Comprehens Canc Ctr, New Haven, CT USA
[4] MTA SE Pediat & Nephrol Res Grp, Budapest, Hungary
来源
GENOME MEDICINE | 2015年 / 7卷
基金
匈牙利科学研究基金会;
关键词
HIGH-FREQUENCY; MUTATIONS; CONCORDANCE; INHIBITION; LANDSCAPE; KINASE;
D O I
10.1186/s13073-015-0228-1
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Background: The use of somatic mutations for predicting clinical outcome is difficult because a mutation can indirectly influence the function of many genes, and also because clinical follow-up is sparse in the relatively young next generation sequencing (NGS) databanks. Here we approach this problem by linking sequence databanks to well annotated gene-chip datasets, using a multigene transcriptomic fingerprint as a link between gene mutations and gene expression in breast cancer patients. Methods: The database consists of 763 NGS samples containing mutational status for 22,938 genes and RNA-seq data for 10,987 genes. The gene chip database contains 5,934 patients with 10,987 genes plus clinical characteristics. For the prediction, mutations present in a sample are first translated into a 'transcriptomic fingerprint' by running ROC analysis on mutation and RNA-seq data. Then correlation to survival is assessed by computing Cox regression for both up-and downregulated signatures. Results: According to this approach, the top driver oncogenes having a mutation prevalence over 5 % included AKT1, TRANK1, TRAPPC10, RPGR, COL6A2, RAPGEF4, ATG2B, CNTRL, NAA38, OSBPL10, POTEF, SCLT1, SUN1, VWDE, MTUS2, and PIK3CA, and the top tumor suppressor genes included PHEX, TP53, GGA3, RGS22, PXDNL, ARFGEF1, BRCA2, CHD8, GCC2, and ARMC4. The system was validated by computing correlation between RNA-seq and microarray data (r(2) = 0.73, P < 1E-16). Cross-validation using 20 genes with a prevalence of approximately 5 % confirmed analysis reproducibility. Conclusions: We established a pipeline enabling rapid clinical validation of a discovered mutation in a large breast cancer cohort. An online interface is available for evaluating any human gene mutation or combinations of maximum three such genes (http://www.g-2-o.com).
引用
收藏
页数:11
相关论文
共 20 条
  • [1] A genome-wide approach to link genotype to clinical outcome by utilizing next generation sequencing and gene chip data of 6,697 breast cancer patients
    Lőrinc Pongor
    Máté Kormos
    Christos Hatzis
    Lajos Pusztai
    András Szabó
    Balázs Győrffy
    Genome Medicine, 7
  • [2] A genome-wide approach to link genotype to clinical outcome by simultaneously utilizing next generation sequencing and gene chip data of 6,697 breast cancer patients
    Gyorffy, Balazs
    Kormos, Mate
    Pongor, Lorinc
    JOURNAL OF CLINICAL ONCOLOGY, 2015, 33 (15)
  • [3] Combination of next generation sequencing and gene chip data to link survival and genotype in breast cancer
    Gyorffy, B.
    Kormos, M.
    Pongor, L.
    EUROPEAN JOURNAL OF CANCER, 2015, 51 : S324 - S324
  • [4] Linking genotype to clinical outcome in breast cancer by combining NGS and gene chip data
    Gyorffy, Balazs
    Pongor, Lorinc
    Kormos, Mate
    CANCER RESEARCH, 2015, 75
  • [5] The Epigenomic Revolution in Breast Cancer From Single-Gene to Genome-Wide Next-Generation Approaches
    Davalos, Veronica
    Martinez-Cardus, Anna
    Esteller, Manel
    AMERICAN JOURNAL OF PATHOLOGY, 2017, 187 (10): : 2163 - 2174
  • [6] Identifying genome-wide immune gene variation underlying infectious disease in wildlife populations – a next generation sequencing approach in the gopher tortoise
    Jean P. Elbers
    Mary B. Brown
    Sabrina S. Taylor
    BMC Genomics, 19
  • [7] Identifying genome-wide immune gene variation underlying infectious disease in wildlife populations - a next generation sequencing approach in the gopher tortoise
    Elbers, Jean P.
    Brown, Mary B.
    Taylor, Sabrina S.
    BMC GENOMICS, 2018, 19
  • [8] Clinical Application of Next-Generation Sequencing in Patients With Breast Cancer: Real-World Data
    Suh, Koung Jin
    Kim, Se Hyun
    Kim, Yu Jung
    Shin, Heechul
    Kang, Eunyoung
    Kim, Eun-Kyu
    Lee, Sejoon
    Woo, Ji Won
    Na, Hee Young
    Ahn, Soomin
    Jang, Bum-Sup
    Kim, In Ah
    Park, So Yeon
    Kim, Jee Hyun
    JOURNAL OF BREAST CANCER, 2022, 25 (05) : 366 - 378
  • [9] Network Biomarkers of Bladder Cancer Based on a Genome-Wide Genetic and Epigenetic Network Derived from Next-Generation Sequencing Data
    Li, Cheng-Wei
    Chen, Bor-Sen
    DISEASE MARKERS, 2016, 2016
  • [10] A genomic approach to selecting robust and versatile SNP sets from next-generation sequencing data for genome-wide association study in citrus cultivars
    Shimizu, T.
    Kaminuma, E.
    Nonaka, K.
    Yoshioka, T.
    Goto, S.
    Matsumoto, T.
    Katayose, Y.
    Mochizuki, T.
    Tanizawa, Y.
    Toyoda, A.
    Fujiyama, A.
    Nakamura, Y.
    III INTERNATIONAL SYMPOSIUM ON CITRUS BIOTECHNOLOGY, 2016, 1135 : 23 - 32