Personalized regression enables sample-specific pan-cancer analysis

被引:9
|
作者
Lengerich, Benjamin J. [1 ]
Aragam, Bryon [2 ]
Xing, Eric P. [1 ,2 ,3 ]
机构
[1] Carnegie Mellon Univ, Dept Comp Sci, Pittsburgh, PA 15213 USA
[2] Carnegie Mellon Univ, Machine Learning Dept, Pittsburgh, PA 15213 USA
[3] Petuum Inc, Pittsburgh, PA 15222 USA
关键词
HETEROGENEITY; EXPRESSION; ONTOLOGY;
D O I
10.1093/bioinformatics/bty250
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: In many applications, inter-sample heterogeneity is crucial to understanding the complex biological processes under study. For example, in genomic analysis of cancers, each patient in a cohort may have a different driver mutation, making it difficult or impossible to identify causal mutations from an averaged view of the entire cohort. Unfortunately, many traditional methods for genomic analysis seek to estimate a single model which is shared by all samples in a population, ignoring this inter-sample heterogeneity entirely. In order to better understand patient heterogeneity, it is necessary to develop practical, personalized statistical models. Results: To uncover this inter-sample heterogeneity, we propose a novel regularizer for achieving patient-specific personalized estimation. This regularizer operates by learning two latent distance metrics-one between personalized parameters and one between clinical covariates- and attempting to match the induced distances as closely as possible. Crucially, we do not assume these distance metrics are already known. Instead, we allow the data to dictate the structure of these latent distance metrics. Finally, we apply our method to learn patient-specific, interpretable models for a pan-cancer gene expression dataset containing samples from more than 30 distinct cancer types and find strong evidence of personalization effects between cancer types as well as between individuals. Our analysis uncovers sample-specific aberrations that are overlooked by population-level methods, suggesting a promising new path for precision analysis of complex diseases such as cancer.
引用
收藏
页码:178 / 186
页数:9
相关论文
共 50 条
  • [21] A pan-cancer analysis of prognostic genesl
    Anaya, Jordan
    Reon, Brian
    Chen, Wei-Min
    Bekiranov, Stefan
    Duna, Anindya
    PEERJ, 2016, 4
  • [22] A pan-cancer analysis of synonymous mutations
    Sharma, Yogita
    Miladi, Milad
    Dukare, Sandeep
    Boulay, Karine
    Caudron-Herger, Maiwen
    Gross, Matthias
    Backofen, Rolf
    Diederichs, Sven
    NATURE COMMUNICATIONS, 2019, 10 (1)
  • [23] Comparative pan-cancer DNA methylation analysis reveals cancer common and specific patterns
    Yang, Xiaofei
    Gao, Lin
    Zhang, Shihua
    BRIEFINGS IN BIOINFORMATICS, 2017, 18 (05) : 761 - 773
  • [24] Second call for pan-cancer analysis
    Nature Genetics, 2014, 46 : 1251 - 1251
  • [25] Taking pan-cancer analysis global
    不详
    NATURE GENETICS, 2013, 45 (11) : 1263 - 1263
  • [26] Pan-cancer analysis reveals sex-specific signatures in the tumor microenvironment
    Han, Junwei
    Yang, Yang
    Li, Xiangmei
    Wu, Jiashuo
    Sheng, Yuqi
    Qiu, Jiayue
    Wang, Qian
    Li, Ji
    He, Yalan
    Cheng, Liang
    Zhang, Yan
    MOLECULAR ONCOLOGY, 2022, 16 (11) : 2153 - 2173
  • [27] Personalized Network Modeling of the Pan-Cancer Patient and Cell Line Interactome
    Bhattacharyya, Rupam
    Ha, Min Jin
    Liu, Qingzhi
    Akbani, Rehan
    Liang, Han
    Baladandayuthapani, Veerabhadran
    JCO CLINICAL CANCER INFORMATICS, 2020, 4 : 399 - 411
  • [28] Research on Specific DNA Methylation Regulatory Mechanism Based on Pan-Cancer Analysis
    Guo, Y.
    Gu, J.
    Ge, D.
    JOURNAL OF THORACIC ONCOLOGY, 2023, 18 (11) : S193 - S193
  • [29] Cancer-specific survival after diagnosis in men versus women: A pan-cancer analysis
    He, Yan
    Su, Yonglin
    Zeng, Junsong
    Chong, Weelic
    Hu, Xiaolin
    Zhang, Yu
    Peng, Xingchen
    MEDCOMM, 2022, 3 (03):
  • [30] The Cancer Genome Atlas Pan-Cancer analysis project
    Weinstein, John N.
    Collisson, Eric A.
    Mills, Gordon B.
    Shaw, Kenna R. Mills
    Ozenberger, Brad A.
    Ellrott, Kyle
    Shmulevich, Ilya
    Sander, Chris
    Stuart, Joshua M.
    NATURE GENETICS, 2013, 45 (10) : 1113 - 1120