Identification of seven-gene marker to predict the survival of patients with lung adenocarcinoma using integrated multi-omics data analysis

被引:9
|
作者
Zhang, Surong [1 ]
Zeng, Xueni [1 ,2 ]
Lin, Shaona [2 ]
Liang, Minchao [3 ]
Huang, Huaxing [2 ]
机构
[1] Guangzhou Med Univ, Affiliated Hosp 2, Dept Infect Dis, Guangzhou, Peoples R China
[2] Guangzhou Med Univ, Affiliated Hosp 2, Dept Pulm & Crit Care Med, Guangzhou 440100, Peoples R China
[3] Shenzhen Haplox Biotechnol Co Ltd, Dept Med, Shenzhen, Peoples R China
关键词
bioinformatics; CNV; lung adenocarcinoma; prognostic markers; TCGA; GENE-EXPRESSION; CANCER; CLASSIFICATION; METHYLATION; COLOPRINT; IMPACT;
D O I
10.1002/jcla.24190
中图分类号
R446 [实验室诊断]; R-33 [实验医学、医学实验];
学科分类号
1001 ;
摘要
Background The mechanism of cancer occurrence and development could be understood with multi-omics data analysis. Discovering genetic markers is highly necessary for predicting clinical outcome of lung adenocarcinoma (LUAD). Methods Clinical follow-up information, copy number variation (CNV) data, single nucleotide polymorphism (SNP), and RNA-Seq were acquired from The Cancer Genome Atlas (TCGA). To obtain robust biomarkers, prognostic-related genes, genes with SNP variation, and copy number differential genes in the training set were selected and further subjected to feature selection using random forests. Finally, a gene-based prediction model for LUAD was validated in validation datasets. Results The study filtered 2071 prognostic-related genes and 230 genomic variants, 1878 copy deletions, and 438 significant mutations. 218 candidate genes were screened through integrating genomic variation genes and prognosis-related genes. 7 characteristic genes (RHOV, CSMD3, FBN2, MAGEL2, SMIM4, BCKDHB, and GANC) were identified by random forest feature selection, and many genes were found to be tumor progression-related. A 7-gene signature constructed by Cox regression analysis was an independent prognostic factor for LUAD patients, and at the same time a risk factor in the test set, external validation set, and training set. Noticeably, the 5-year AUC of survival in the validation set and training set was all > 0.67. Similar results were obtained from multi-omics validation datasets. Conclusions The study builds a novel 7-gene signature as a prognostic marker for the survival prediction of patients with LUAD. The current findings provided a set of new prognostic and diagnostic biomarkers and therapeutic targets.
引用
收藏
页数:14
相关论文
共 50 条
  • [31] Progressively Helical Multi-Omics Data Fusion GCN and Its Application in Lung Adenocarcinoma
    Zhu, Junxuan
    Zhang, Jinhan
    Wang, Liyan
    Huang, Hao
    Zhang, Zhibo
    Song, Kai
    Zhang, Xiaofei
    IEEE ACCESS, 2023, 11 : 73568 - 73582
  • [32] Phenotype profiling of tumor microenvironment in EGFR mutant lung adenocarcinoma with multi-omics data
    Lee, You Won
    Lee, Eun Ji
    Oh, Seung Yeon
    Pyo, Kyoung-Ho
    Heo, Seong Gu
    Park, YoungJoon
    Choi, Su-Jin
    Lim, Kyumin
    Lee, Ju-Hyeon
    Kim, Jae Hwan
    Lee, Jii Bum
    Lee, Ji Yoon
    Lim, Sun Min
    Kim, Chang Gon
    Hong, Min Hee
    Yun, Mi Ran
    Cho, Byoung Chul
    CANCER RESEARCH, 2023, 83 (07)
  • [33] Geometric graph neural networks on multi-omics data to predict cancer survival outcomes
    Zhu, Jiening
    Oh, Jung Hun
    Simhal, Anish K.
    Elkin, Rena
    Norton, Larry
    Deasy, Joseph O.
    Tannenbaum, Allen
    COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 163
  • [34] MONGKIE: an integrated tool for network analysis and visualization for multi-omics data
    Jang, Yeongjun
    Yu, Namhee
    Seo, Jihae
    Kim, Sun
    Lee, Sanghyuk
    BIOLOGY DIRECT, 2016, 11
  • [35] MONGKIE: an integrated tool for network analysis and visualization for multi-omics data
    Yeongjun Jang
    Namhee Yu
    Jihae Seo
    Sun Kim
    Sanghyuk Lee
    Biology Direct, 11
  • [36] Deep Learning for Integrated Analysis of Insulin Resistance with Multi-Omics Data
    Huang, Eunchong
    Kim, Sarah
    Ahn, TaeJin
    JOURNAL OF PERSONALIZED MEDICINE, 2021, 11 (02): : 1 - 14
  • [37] Immunomodulatory Factor TIM3 of Cytolytic Active Genes Affected the Survival and Prognosis of Lung Adenocarcinoma Patients by Multi-Omics Analysis
    Wu, Liusheng
    Zhong, Yanfeng
    Wu, Dingwang
    Xu, Pengcheng
    Ruan, Xin
    Yan, Jun
    Liu, Jixian
    Li, Xiaoqiang
    BIOMEDICINES, 2022, 10 (09)
  • [38] FAIR Data Cube, a FAIR data infrastructure for integrated multi-omics data analysis
    Xiaofeng Liao
    Thomas H.A. Ederveen
    Anna Niehues
    Casper de Visser
    Junda Huang
    Firdaws Badmus
    Cenna Doornbos
    Yuliia Orlova
    Purva Kulkarni
    K. Joeri van der Velde
    Morris A. Swertz
    Martin Brandt
    Alain J. van Gool
    Peter A. C. ’t Hoen
    Journal of Biomedical Semantics, 15 (1)
  • [39] Functional and Prognostic Roles of Butyrophilin Genes in Lung Adenocarcinoma: Multi-Omics Integration and Survival Analyses
    Qi X.
    Chen S.
    Zuo J.
    Yan D.
    Chen J.
    Dianzi Keji Daxue Xuebao/Journal of the University of Electronic Science and Technology of China, 2021, 50 (06): : 828 - 836
  • [40] Multi-Omics Analysis Reveals a HIF Network and Hub Gene EPAS1 Associated with Lung Adenocarcinoma
    Wang, Zhaoxi
    Wei, Yongyue
    Zhang, Ruyang
    Su, Li
    Gogarten, Stephanie M.
    Liu, Geoffrey
    Brennan, Paul
    Field, John K.
    McKay, James D.
    Lissowska, Jolanta
    Swiatkowska, Beata
    Janout, Vladimir
    Bolca, Ciprian
    Kontic, Milica
    Scelo, Ghislaine
    Zaridze, David
    Laurie, Cathy C.
    Doheny, Kimberly F.
    Pugh, Elizabeth K.
    Marosy, Beth A.
    Hetrick, Kurt N.
    Xiao, Xiangjun
    Pikielny, Claudio
    Hung, Rayjean J.
    Amos, Christopher I.
    Lin, Xihong
    Christiani, David C.
    EBIOMEDICINE, 2018, 32 : 93 - 101