Personalized regression enables sample-specific pan-cancer analysis

被引:9
|
作者
Lengerich, Benjamin J. [1 ]
Aragam, Bryon [2 ]
Xing, Eric P. [1 ,2 ,3 ]
机构
[1] Carnegie Mellon Univ, Dept Comp Sci, Pittsburgh, PA 15213 USA
[2] Carnegie Mellon Univ, Machine Learning Dept, Pittsburgh, PA 15213 USA
[3] Petuum Inc, Pittsburgh, PA 15222 USA
关键词
HETEROGENEITY; EXPRESSION; ONTOLOGY;
D O I
10.1093/bioinformatics/bty250
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: In many applications, inter-sample heterogeneity is crucial to understanding the complex biological processes under study. For example, in genomic analysis of cancers, each patient in a cohort may have a different driver mutation, making it difficult or impossible to identify causal mutations from an averaged view of the entire cohort. Unfortunately, many traditional methods for genomic analysis seek to estimate a single model which is shared by all samples in a population, ignoring this inter-sample heterogeneity entirely. In order to better understand patient heterogeneity, it is necessary to develop practical, personalized statistical models. Results: To uncover this inter-sample heterogeneity, we propose a novel regularizer for achieving patient-specific personalized estimation. This regularizer operates by learning two latent distance metrics-one between personalized parameters and one between clinical covariates- and attempting to match the induced distances as closely as possible. Crucially, we do not assume these distance metrics are already known. Instead, we allow the data to dictate the structure of these latent distance metrics. Finally, we apply our method to learn patient-specific, interpretable models for a pan-cancer gene expression dataset containing samples from more than 30 distinct cancer types and find strong evidence of personalization effects between cancer types as well as between individuals. Our analysis uncovers sample-specific aberrations that are overlooked by population-level methods, suggesting a promising new path for precision analysis of complex diseases such as cancer.
引用
收藏
页码:178 / 186
页数:9
相关论文
共 50 条
  • [31] The Cancer Genome Atlas Pan-Cancer analysis project
    John N Weinstein
    Eric A Collisson
    Gordon B Mills
    Kenna R Mills Shaw
    Brad A Ozenberger
    Kyle Ellrott
    Ilya Shmulevich
    Chris Sander
    Joshua M Stuart
    Nature Genetics, 2013, 45 : 1113 - 1120
  • [32] EXTRACTING SIGNIFICANT SAMPLE-SPECIFIC CANCER MUTATIONS USING THEIR PROTEIN INTERACTIONS
    Badea, Liviu
    PACIFIC SYMPOSIUM ON BIOCOMPUTING 2014, 2014, : 15 - 26
  • [33] Sample-Specific Perturbation of Gene Interactions Identifies Pancreatic Cancer Subtypes
    Wei, Ran
    Zhang, Huihui
    Cao, Jianzhong
    Qin, Dailei
    Li, Shengping
    Deng, Wuguo
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2022, 23 (09)
  • [34] Pan-Cancer Analysis of Postdiagnosis Exercise and Mortality
    Lavery, Jessica A.
    Boutros, Paul C.
    Scott, Jessica M.
    Tammela, Tuomas
    Moskowitz, Chaya S.
    Jones, Lee W.
    JOURNAL OF CLINICAL ONCOLOGY, 2023, 41 (32) : 4982 - +
  • [35] Pan-cancer analysis of prognostic metastatic phenotypes
    Zaorsky, Nicholas G.
    Wang, Xi
    Garrett, Sara M.
    Lehrer, Eric J.
    Lin, Christine
    DeGraff, David J.
    Spratt, Daniel E.
    Trifiletti, Daniel M.
    Kishan, Amar U.
    Showalter, Timothy N.
    Park, Henry S.
    Yang, Jonathan T.
    Chinchilli, Vernon M.
    Wang, Ming
    INTERNATIONAL JOURNAL OF CANCER, 2022, 150 (01) : 132 - 141
  • [36] Integrative analysis the characterization of peroxiredoxins in pan-cancer
    Gao, Lei
    Meng, Jialin
    Yue, Chuang
    Wu, Xingyu
    Su, Quanxin
    Wu, Hao
    Zhang, Ze
    Yu, Qinzhou
    Gao, Shenglin
    Fan, Song
    Zuo, Li
    CANCER CELL INTERNATIONAL, 2021, 21 (01)
  • [37] Pan-cancer analysis of telomere maintenance mechanisms
    Hakobyan, Meline
    Binder, Hans
    Arakelyan, Arsen
    JOURNAL OF BIOLOGICAL CHEMISTRY, 2024, 300 (06)
  • [38] Pan-Cancer Analysis of Prognostic Metastatic Phenotypes
    Zaorsky, N. G.
    Wang, X.
    Lehrer, E. J.
    Lin, C.
    Garrett, S. M.
    Zhang, Y.
    DeGraff, D.
    Spratt, D. E.
    Trifiletti, D. M.
    Kishan, A. U.
    Showalter, T. N.
    Park, H. S. M.
    Yang, J. T.
    Wang, M.
    INTERNATIONAL JOURNAL OF RADIATION ONCOLOGY BIOLOGY PHYSICS, 2021, 111 (03): : S64 - S64
  • [39] Pan-cancer analysis of the metabolic reaction network
    Gatto, Francesco
    Ferreira, Raphael
    Nielsen, Jens
    METABOLIC ENGINEERING, 2020, 57 : 51 - 62
  • [40] Systematic pan-cancer analysis of tumour purity
    Dvir Aran
    Marina Sirota
    Atul J. Butte
    Nature Communications, 6