Joint learning improves protein abundance prediction in cancers

被引:15
|
作者
Li, Hongyang [1 ]
Siddiqui, Omer [1 ]
Zhang, Hongjiu [1 ]
Guan, Yuanfang [1 ,2 ]
机构
[1] Univ Michigan, Dept Computat Med & Bioinformat, 100 Washtenaw Ave, Ann Arbor, MI 48109 USA
[2] Univ Michigan, Dept Internal Med, 100 Washtenaw Ave, Ann Arbor, MI 48109 USA
关键词
Cancer; Proteomics; Transcriptomics; Machine learning; PROTEOGENOMIC CHARACTERIZATION; RNA; TOOL; DETERMINANTS; QUANTITATION; PATHWAY;
D O I
10.1186/s12915-019-0730-9
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background The classic central dogma in biology is the information flow from DNA to mRNA to protein, yet complicated regulatory mechanisms underlying protein translation often lead to weak correlations between mRNA and protein abundances. This is particularly the case in cancer samples and when evaluating the same gene across multiple samples. Results Here, we report a method for predicting proteome from transcriptome, using a training dataset provided by NCI-CPTAC and TCGA, consisting of transcriptome and proteome data from 77 breast and 105 ovarian cancer samples. First, we establish a generic model capturing the correlation between mRNA and protein abundance of a single gene. Second, we build a gene-specific model capturing the interdependencies among multiple genes in a regulatory network. Third, we create a cross-tissue model by joint learning the information of shared regulatory networks and pathways across cancer tissues. Our method ranked first in the NCI-CPTAC DREAM Proteogenomics Challenge, and the predictive performance is close to the accuracy of experimental replicates. Key functional pathways and network modules controlling the proteomic abundance in cancers were revealed, in particular metabolism-related genes. Conclusions We present a method to predict proteome from transcriptome, leveraging data from different cancer tissues to build a trans-tissue model, and suggest how to integrate information from multiple cancers to provide a foundation for further research.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Joint learning improves protein abundance prediction in cancers
    Hongyang Li
    Omer Siddiqui
    Hongjiu Zhang
    Yuanfang Guan
    [J]. BMC Biology, 17
  • [2] Protein Abundance Prediction Through Machine Learning Methods
    Ferreira, Mauricio
    Ventorim, Rafaela
    Almeida, Eduardo
    Silveira, Sabrina
    Silveira, Wendel
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 2021, 433 (22)
  • [4] Learning spatial structures of proteins improves protein-protein interaction prediction
    Song, Bosheng
    Luo, Xiaoyan
    Luo, Xiaoli
    Liu, Yuansheng
    Niu, Zhangming
    Zeng, Xiangxiang
    [J]. BRIEFINGS IN BIOINFORMATICS, 2022, 23 (02)
  • [5] NEW JOINT PREDICTION ALGORITHM (Q7-JASEP) IMPROVES THE PREDICTION OF PROTEIN SECONDARY STRUCTURE
    VISWANADHAN, VN
    DENCKLA, B
    WEINSTEIN, JN
    [J]. BIOCHEMISTRY, 1991, 30 (46) : 11164 - 11172
  • [6] Machine Learning Improves Diagnosis of CNS Cancers
    Dolgin, Elie
    [J]. CANCER DISCOVERY, 2018, 8 (05) : 523 - 524
  • [7] Cross-protein transfer learning substantially improves disease variant prediction
    Jagota, Milind
    Ye, Chengzhong
    Albors, Carlos
    Rastogi, Ruchir
    Koehl, Antoine
    Ioannidis, Nilah
    Song, Yun S.
    [J]. GENOME BIOLOGY, 2023, 24 (01)
  • [8] Cross-protein transfer learning substantially improves disease variant prediction
    Milind Jagota
    Chengzhong Ye
    Carlos Albors
    Ruchir Rastogi
    Antoine Koehl
    Nilah Ioannidis
    Yun S. Song
    [J]. Genome Biology, 24
  • [9] Protein Contact Prediction by Integrating Joint Evolutionary Coupling Analysis and Supervised Learning
    Ma, Jianzhu
    Wang, Sheng
    Wang, Zhiyong
    Xu, Jinbo
    [J]. RESEARCH IN COMPUTATIONAL MOLECULAR BIOLOGY (RECOMB 2015), 2015, 9029 : 218 - 221
  • [10] Protein contact prediction by integrating joint evolutionary coupling analysis and supervised learning
    Ma, Jianzhu
    Wang, Sheng
    Wang, Zhiyong
    Xu, Jinbo
    [J]. BIOINFORMATICS, 2015, 31 (21) : 3506 - 3513