SuMO-Fil: Supervised multi-omic filtering prior to performing network analysis

被引:2
|
作者
Towle-Miller, Lorin M. [1 ]
Miecznikowski, Jeffrey C. [1 ]
Zhang, Fan [1 ]
Tritchler, David L. [1 ,2 ]
机构
[1] Univ Buffalo, Dept Biostat, Buffalo, NY 14203 USA
[2] Univ Toronto, Biostat Div, Toronto, ON, Canada
来源
PLOS ONE | 2021年 / 16卷 / 08期
基金
美国国家卫生研究院;
关键词
ENDOMETRIAL CANCER; IDENTIFICATION; VARIANTS; PATHWAY;
D O I
10.1371/journal.pone.0255579
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Multi-omic analyses that integrate many high-dimensional datasets often present significant deficiencies in statistical power and require time consuming computations to execute the analytical methods. We present SuMO-Fil to remedy against these issues which is a pre-processing method for Supervised Multi-Omic Filtering that removes variables or features considered to be irrelevant noise. SuMO-Fil is intended to be performed prior to downstream analyses that detect supervised gene networks in sparse settings. We accomplish this by implementing variable filters based on low similarity across the datasets in conjunction with low similarity with the outcome. This approach can improve accuracy, as well as reduce run times for a variety of computationally expensive downstream analyses. This method has applications in a setting where the downstream analysis may include sparse canonical correlation analysis. Filtering methods specifically for cluster and network analysis are introduced and compared by simulating modular networks with known statistical properties. The SuMO-Fil method performs favorably by eliminating non-network features while maintaining important biological signal under a variety of different signal settings as compared to popular filtering techniques based on low means or low variances. We show that the speed and accuracy of methods such as supervised sparse canonical correlation are increased after using SuMO-Fil, thus greatly improving the scalability of these approaches.
引用
收藏
页数:20
相关论文
共 50 条
  • [1] MoNET: an R package for multi-omic network analysis
    Li, Jin
    Chen, Feng
    Liang, Hong
    Yan, Jingwen
    [J]. BIOINFORMATICS, 2022, 38 (04) : 1165 - 1167
  • [2] Multi-omic network signatures of disease
    Gibbs, David L.
    Gralinski, Lisa
    Baric, Ralph S.
    McWeeney, Shannon K.
    [J]. FRONTIERS IN GENETICS, 2014, 4
  • [3] Multi-omic Pathway and Network Analysis to Identify Biomakers for Hepatocellular Carcinoma
    Barefoot, Megan E.
    Varghese, Rency S.
    Zhou, Yuan
    Di Poto, Cristina
    Ferrarini, Alessia
    Ressom, Habtom W.
    [J]. 2019 41ST ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2019, : 1350 - 1354
  • [4] Multi-view Factorization AutoEncoder with Network Constraints for Multi-omic Integrative Analysis
    Ma, Tianle
    Zhang, Aidong
    [J]. PROCEEDINGS 2018 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2018, : 702 - 707
  • [5] Multi-omic analyses and network biology in cardiovascular disease
    Reitz, Cristine J.
    Kuzmanov, Uros
    Gramolini, Anthony O.
    [J]. PROTEOMICS, 2023, 23 (21-22)
  • [6] Multi-omic data analysis using Galaxy
    Boekel, Jorrit
    Chilton, John M.
    Cooke, Ira R.
    Horvatovich, Peter L.
    Jagtap, Pratik D.
    Kall, Lukas
    Lehtio, Janne
    Lukasse, Pieter
    Moerland, Perry D.
    Griffin, Timothy J.
    [J]. NATURE BIOTECHNOLOGY, 2015, 33 (02) : 137 - 139
  • [7] Multi-omic data analysis using Galaxy
    Jorrit Boekel
    John M Chilton
    Ira R Cooke
    Peter L Horvatovich
    Pratik D Jagtap
    Lukas Käll
    Janne Lehtiö
    Pieter Lukasse
    Perry D Moerland
    Timothy J Griffin
    [J]. Nature Biotechnology, 2015, 33 : 137 - 139
  • [8] A guided network estimation approach using multi-omic information
    Bartzis, Georgios
    Peeters, Carel F. W.
    Ligterink, Wilco
    Van Eeuwijk, Fred A.
    [J]. BMC BIOINFORMATICS, 2024, 25 (01):
  • [9] MOSCATO: a supervised approach for analyzing multi-Omic single-Cell data
    Towle-Miller, Lorin M.
    Miecznikowski, Jeffrey C.
    [J]. BMC GENOMICS, 2022, 23 (01)
  • [10] MOSCATO: a supervised approach for analyzing multi-Omic single-Cell data
    Lorin M. Towle-Miller
    Jeffrey C. Miecznikowski
    [J]. BMC Genomics, 23