IntLIM: integration using linear models of metabolomics and gene expression data

被引:27
|
作者
Siddiqui, Jalal K. [1 ]
Baskin, Elizabeth [1 ]
Liu, Mingrui [1 ]
Cantemir-Stone, Carmen Z. [1 ]
Zhang, Bofei [1 ,5 ]
Bonneville, Russell [2 ,3 ]
McElroy, Joseph P. [4 ]
Coombes, Kevin R. [1 ]
Mathe, Ewy A. [1 ]
机构
[1] Ohio State Univ, Coll Med, Dept Biomed Informat, Columbus, OH 43210 USA
[2] Ohio State Univ, Biomed Sci Grad Program, Columbus, OH 43210 USA
[3] Ohio State Univ, Dept Internal Med, Comprehens Canc Ctr, Columbus, OH 43210 USA
[4] Ohio State Univ, Ctr Biostat, Columbus, OH 43210 USA
[5] Ohio State Univ, Biomed Engn Undegrad Program, Columbus, OH 43210 USA
来源
BMC BIOINFORMATICS | 2018年 / 19卷
关键词
Metabolomics; Transcriptomics; Linear Modeling; Integration; PROSTATE-CANCER; R PACKAGE; BIOMARKERS; BREAST; TRANSCRIPTOMICS; METAANALYSIS; METABOLISM; CHALLENGES; DISCOVERY; MIGRATION;
D O I
10.1186/s12859-018-2085-6
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Integration of transcriptomic and metabolomic data improves functional interpretation of disease-related metabolomic phenotypes, and facilitates discovery of putative metabolite biomarkers and gene targets. For this reason, these data are increasingly collected in large (> 100 participants) cohorts, thereby driving a need for the development of user-friendly and open-source methods/tools for their integration. Of note, clinical/translational studies typically provide snapshot (e.g. one time point) gene and metabolite profiles and, oftentimes, most metabolites measured are not identified. Thus, in these types of studies, pathway/network approaches that take into account the complexity of transcript-metabolite relationships may neither be applicable nor readily uncover novel relationships. With this in mind, we propose a simple linear modeling approach to capture disease-(or other phenotype) specific gene-metabolite associations, with the assumption that co-regulation patterns reflect functionally related genes and metabolites. Results: The proposed linear model, metabolite similar to gene + phenotype + gene: phenotype, specifically evaluates whether gene-metabolite relationships differ by phenotype, by testing whether the relationship in one phenotype is significantly different from the relationship in another phenotype (via a statistical interaction gene: phenotype p-value). Statistical interaction p-values for all possible gene-metabolite pairs are computed and significant pairs are then clustered by the directionality of associations (e.g. strong positive association in one phenotype, strong negative association in another phenotype). We implemented our approach as an R package, IntLIM, which includes a user-friendly R Shiny web interface, thereby making the integrative analyses accessible to non-computational experts. We applied IntLIM to two previously published datasets, collected in the NCI-60 cancer cell lines and in human breast tumor and non-tumor tissue, for which transcriptomic and metabolomic data are available. We demonstrate that IntLIM captures relevant tumor-specific gene-metabolite associations involved in known cancer-related pathways, including glutamine metabolism. Using IntLIM, we also uncover biologically relevant novel relationships that could be further tested experimentally. Conclusions: IntLIM provides a user-friendly, reproducible framework to integrate transcriptomic and metabolomic data and help interpret metabolomic data and uncover novel gene-metabolite relationships. The IntLIM R package is publicly available in GitHub (https://github. com/mathelab/IntLIM) and includes a user-friendly web application, vignettes, sample data and data/code to reproduce results.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] IntLIM: integration using linear models of metabolomics and gene expression data
    Jalal K. Siddiqui
    Elizabeth Baskin
    Mingrui Liu
    Carmen Z. Cantemir-Stone
    Bofei Zhang
    Russell Bonneville
    Joseph P. McElroy
    Kevin R. Coombes
    Ewy A. Mathé
    BMC Bioinformatics, 19
  • [2] Dynamic models for metabolomics data integration
    Lakrisenko, Polina
    Weindl, Daniel
    CURRENT OPINION IN SYSTEMS BIOLOGY, 2021, 28
  • [3] Integration of metabolomics with genomics: Metabolic gene prioritization using metabolomics data and genomic variant (CADD) scores
    Bongaerts, Michiel
    Bonte, Ramon
    Demirdas, Serwet
    Huidekoper, Hidde H.
    Langendonk, Janneke
    Wilke, Martina
    de Valk, Walter
    Blom, Henk J.
    Reinders, Marcel J. T.
    Ruijter, George J. G.
    MOLECULAR GENETICS AND METABOLISM, 2022, 136 (03) : 199 - 218
  • [4] Integration of gene expression data into genome-scale metabolic models
    Åkesson, M
    Förster, J
    Nielsen, J
    METABOLIC ENGINEERING, 2004, 6 (04) : 285 - 293
  • [5] MixMir: microRNA motif discovery from gene expression data using mixed linear models
    Diao, Liyang
    Marcais, Antoine
    Norton, Scott
    Chen, Kevin C.
    NUCLEIC ACIDS RESEARCH, 2014, 42 (17)
  • [6] Integration of biological networks and gene expression data using Cytoscape
    Cline, Melissa S.
    Smoot, Michael
    Cerami, Ethan
    Kuchinsky, Allan
    Landys, Nerius
    Workman, Chris
    Christmas, Rowan
    Avila-Campilo, Iliana
    Creech, Michael
    Gross, Benjamin
    Hanspers, Kristina
    Isserlin, Ruth
    Kelley, Ryan
    Killcoyne, Sarah
    Lotia, Samad
    Maere, Steven
    Morris, John
    Ono, Keiichiro
    Pavlovic, Vuk
    Pico, Alexander R.
    Vailaya, Aditya
    Wang, Peng-Liang
    Adler, Annette
    Conklin, Bruce R.
    Hood, Leroy
    Kuiper, Martin
    Sander, Chris
    Schmulevich, Ilya
    Schwikowski, Benno
    Warner, Guy J.
    Ideker, Trey
    Bader, Gary D.
    NATURE PROTOCOLS, 2007, 2 (10) : 2366 - 2382
  • [7] Integration of biological networks and gene expression data using Cytoscape
    Melissa S Cline
    Michael Smoot
    Ethan Cerami
    Allan Kuchinsky
    Nerius Landys
    Chris Workman
    Rowan Christmas
    Iliana Avila-Campilo
    Michael Creech
    Benjamin Gross
    Kristina Hanspers
    Ruth Isserlin
    Ryan Kelley
    Sarah Killcoyne
    Samad Lotia
    Steven Maere
    John Morris
    Keiichiro Ono
    Vuk Pavlovic
    Alexander R Pico
    Aditya Vailaya
    Peng-Liang Wang
    Annette Adler
    Bruce R Conklin
    Leroy Hood
    Martin Kuiper
    Chris Sander
    Ilya Schmulevich
    Benno Schwikowski
    Guy J Warner
    Trey Ideker
    Gary D Bader
    Nature Protocols, 2007, 2 : 2366 - 2382
  • [8] Data Integration for gene expression prediction
    Bayrak, Tuncay
    Ogul, Hasan
    2018 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND DATA PROCESSING (IDAP), 2018,
  • [9] Cis-SNPs Set Testing and PrediXcan Analysis for Gene Expression Data using Linear Mixed Models
    Zeng, Ping
    Wang, Ting
    Huang, Shuiping
    SCIENTIFIC REPORTS, 2017, 7
  • [10] Cis-SNPs Set Testing and PrediXcan Analysis for Gene Expression Data using Linear Mixed Models
    Ping Zeng
    Ting Wang
    Shuiping Huang
    Scientific Reports, 7