Blood-based multi-tissue gene expression inference with Bayesian ridge regression

被引:29
|
作者
Xu, Wenjian [1 ]
Liu, Xuanshi [1 ]
Leng, Fei [1 ]
Li, Wei [1 ]
机构
[1] Capital Med Univ, Beijing Childrens Hosp,Beijing Key Lab Genet Birt, Beijing Pediat Res Inst,MOE Key Lab Major Dis Chi, Genet & Birth Defects Control Ctr,Natl Ctr Childr, Beijing 100045, Peoples R China
关键词
RNA-SEQ; PAN-CANCER; SIGNATURES; DISEASE;
D O I
10.1093/bioinformatics/btaa239
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Gene expression profiling is widely used in basic and cancer research but still not feasible in many clinical applications because tissues, such as brain samples, are difficult and not ethnical to collect. Gene expression in uncollected tissues can be computationally inferred using genotype and expression quantitative trait loci. No methods can infer unmeasured gene expression of multiple tissues with single tissue gene expression profile as input. Results: Here, we present a Bayesian ridge regression-based method (B-GEX) to infer gene expression profiles of multiple tissues from blood gene expression profile. For each gene in a tissue, a low-dimensional feature vector was extracted from whole blood gene expression profile by feature selection. We used GTEx RNAseq data of 16 tissues to train inference models to capture the cross-tissue expression correlations between each target gene in a tissue and its preselected feature genes in peripheral blood. We compared B-GEX with least square regression, LASSO regression and ridge regression. B-GEX outperforms the other three models in most tissues in terms of mean absolute error, Pearson correlation coefficient and root-mean-squared error. Moreover, B-GEX infers expression level of tissue-specific genes as well as those of non-tissue-specific genes in all tissues. Unlike previous methods, which require genomic features or gene expression profiles of multiple tissues, our model only requires whole blood expression profile as input. B-GEX helps gain insights into gene expressions of uncollected tissues from more accessible data of blood.
引用
收藏
页码:3788 / 3794
页数:7
相关论文
共 50 条
  • [1] New Insights into the Genetic Control of Gene Expression using a Bayesian Multi-tissue Approach
    Petretto, Enrico
    Bottolo, Leonardo
    Langley, Sarah R.
    Heinig, Matthias
    McDermott-Roe, Chris
    Sarwar, Rizwan
    Pravenec, Michal
    Huebner, Norbert
    Aitman, Timothy J.
    Cook, Stuart A.
    Richardson, Sylvia
    [J]. PLOS COMPUTATIONAL BIOLOGY, 2010, 6 (04)
  • [2] Model-based clustering of multi-tissue gene expression data
    Erola, Pau
    Bjorkegren, Johan L. M.
    Michoel, Tom
    [J]. BIOINFORMATICS, 2020, 36 (06) : 1807 - 1813
  • [3] Hypergraph factorization for multi-tissue gene expression imputation
    Vinas, Ramon
    Joshi, Chaitanya K.
    Georgiev, Dobrik
    Lin, Phillip
    Dumitrascu, Bianca
    Gamazon, Eric R.
    Lio, Pietro
    [J]. NATURE MACHINE INTELLIGENCE, 2023, 5 (7) : 739 - 753
  • [4] Hypergraph factorization for multi-tissue gene expression imputation
    Ramon Viñas
    Chaitanya K. Joshi
    Dobrik Georgiev
    Phillip Lin
    Bianca Dumitrascu
    Eric R. Gamazon
    Pietro Liò
    [J]. Nature Machine Intelligence, 2023, 5 : 739 - 753
  • [5] HDTD: analyzing multi-tissue gene expression data
    Touloumis, Anestis
    Marioni, John C.
    Tavare, Simon
    [J]. BIOINFORMATICS, 2016, 32 (14) : 2193 - 2195
  • [6] A multi-tissue gene expression dataset for hibernating brown bears
    Blair W. Perry
    Michael W. Saxton
    Heiko T. Jansen
    Corey R. Quackenbush
    Brandon D. Evans Hutzenbiler
    Charles T. Robbins
    Joanna L. Kelley
    Omar E. Cornejo
    [J]. BMC Genomic Data, 24
  • [7] A multi-tissue gene expression dataset for hibernating brown bears
    Perry, Blair W.
    Saxton, Michael W.
    Jansen, Heiko T.
    Quackenbush, Corey R.
    Hutzenbiler, Brandon D. Evans D.
    Robbins, Charles T.
    Kelley, Joanna L.
    Cornejo, Omar E.
    [J]. BMC GENOMIC DATA, 2023, 24 (01):
  • [8] A Tissue-aware Gene Selection Approach for Analyzing Multi-tissue Gene Expression Data
    Perscheid, Cindy
    Faber, Lukas
    Kraus, Milena
    Arndt, Paul
    Janke, Michael
    Rehfeldt, Sebastian
    Schubotz, Antje
    Slosarek, Tamara
    Uflacker, Matthias
    [J]. PROCEEDINGS 2018 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2018, : 2159 - 2166
  • [9] Blood-Based Gene Expression Tests Promises and Limitations
    Zeller, Tanja
    Blankenberg, Stefan
    [J]. CIRCULATION-CARDIOVASCULAR GENETICS, 2013, 6 (02) : 139 - 140
  • [10] The multi-tissue gene expression and physiological responses of water deprived Peromyscus eremicus
    Blumstein, Danielle
    MacManes, Matthew
    [J]. BMC GENOMICS, 2024, 25 (01):