Integration of Multi-Omics Data for Gene Regulatory Network Inference and Application to Breast Cancer

被引:43
|
作者
Yuan, Lin [1 ]
Guo, Le-Hang [2 ]
Yuan, Chang-An [3 ]
Zhang, Youhua [4 ]
Han, Kyungsook [5 ]
Nandi, Asoke K. [6 ]
Honig, Barry [7 ]
Huang, De-Shuang [1 ]
机构
[1] Tongji Univ, Inst Machine Learning & Syst Biol, Sch Elect & Informat Engn, Caoan Rd 4800, Shanghai 201804, Peoples R China
[2] Tongji Univ, Shanghai Peoples Hosp 10, Dept Med Ultrasound, Ultrasound Res & Educ Inst,Sch Med, Yanchang Middle Rd 301, Shanghai 200072, Peoples R China
[3] Guangxi Teachers Educ Univ, Sci Comp & Intelligent Informat Proc GuangXi High, Nanning 530001, Guangxi, Peoples R China
[4] Anhui Agr Univ, Sch Informat & Comp, Changjiang West Rd 130, Hefei, Anhui, Peoples R China
[5] Engn Inha Univ, Sch Comp Sci, Incheon 22212, South Korea
[6] Brunel Univ London, Dept Elect & Comp Engn, Uxbridge UB8 3PH, Middx, England
[7] Ctr Computat Biol & Bioinformat, 1130 St Nicholas Ave,Room 815, New York, NY 10032 USA
基金
中国博士后科学基金; 中国国家自然科学基金;
关键词
Biweight midcorrelation; differential correlation; nonconvex penalty; gene regulatory network; stability selection; VARIABLE SELECTION; DNA METHYLATION; PROTEIN; METHODOLOGY;
D O I
10.1109/TCBB.2018.2866836
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Underlying a cancer phenotype is a specific gene regulatory network that represents the complex regulatory relationships between genes. It remains, however, a challenge to find cancer-related gene regulatory network because of insufficient sample sizes and complex regulatory mechanisms in which gene is influenced by not only other genes but also other biological factors. With the development of high-throughput technologies and the unprecedented wealth of multi-omics data it gives us a new opportunity to design machine learning method to investigate underlying gene regulatory network. In this paper, we propose an approach, which use Biweight Midcorrelation to measure the correlation between factors and make use of Nonconvex Penalty based sparse regression for Gene Regulatory Network inference (BMNPGRN). BMNCGRN incorporates multi-omics data (including DNA methylation and copy number variation) and their interactions in gene regulatory network model. The experimental results on synthetic datasets show that BMNPGRN outperforms popular and state-of-the-art methods (inducing DCGRN, ARACNE, and CLR) under false positive control. Furthermore, we applied BMNPGRN on breast cancer (BRCA) data from The Cancer Genome Atlas database and provided gene regulatory network.
引用
收藏
页码:782 / 791
页数:10
相关论文
共 50 条
  • [1] Integration of multi-omics data for integrative gene regulatory network inference
    Zarayeneh, Neda
    Ko, Euiseong
    Oh, Jung Hun
    Suh, Sang
    Liu, Chunyu
    Gao, Jean
    Kim, Donghyun
    Kang, Mingon
    [J]. INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2017, 18 (03) : 223 - 239
  • [2] Integration of single-cell multi-omics for gene regulatory network inference
    Hu, Xinlin
    Hu, Yaohua
    Wu, Fanjie
    Leung, Ricky Wai Tak
    Qin, Jing
    [J]. COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2020, 18 : 1925 - 1938
  • [3] Multi-omics regulatory network inference in the presence of missing data
    Henao, Juan D.
    Lauber, Michael
    Azevedo, Manuel
    Grekova, Anastasiia
    Theis, Fabian
    List, Markus
    Ogris, Christoph
    Schubert, Benjamin
    [J]. BRIEFINGS IN BIOINFORMATICS, 2023, 24 (05)
  • [4] Multi-omics Data Integration and Network Inference for Biomarker Discovery in Glioma
    Coletti, Roberta
    Lopes, Marta B.
    [J]. PROGRESS IN ARTIFICIAL INTELLIGENCE, EPIA 2023, PT II, 2023, 14116 : 247 - 259
  • [5] Survey on Multi-omics, and Multi-omics Data Analysis, Integration and Application
    Shahrajabian, Mohamad Hesam
    Sun, Wenli
    [J]. CURRENT PHARMACEUTICAL ANALYSIS, 2023, 19 (04) : 267 - 281
  • [6] Gene regulatory network inference in the era of single-cell multi-omics
    Pau Badia-i-Mompel
    Lorna Wessels
    Sophia Müller-Dott
    Rémi Trimbour
    Ricardo O. Ramirez Flores
    Ricard Argelaguet
    Julio Saez-Rodriguez
    [J]. Nature Reviews Genetics, 2023, 24 : 739 - 754
  • [7] Gene regulatory network inference in the era of single-cell multi-omics
    Badia-i-Mompel, Pau
    Wessels, Lorna
    Mueller-Dott, Sophia
    Trimbour, Remi
    Flores, Ricardo Ramirez O.
    Argelaguet, Ricard
    Saez-Rodriguez, Julio
    [J]. NATURE REVIEWS GENETICS, 2023, 24 (11) : 739 - 754
  • [8] Multi-omics data integration reveals correlated regulatory features of triple negative breast cancer
    Chappell, Kevin
    Manna, Kanishka
    Washam, Charity L.
    Graw, Stefan
    Alkam, Duah
    Thompson, Matthew D.
    Zafar, Maroof Khan
    Hazeslip, Lindsey
    Randolph, Christopher
    Gies, Allen
    Bird, Jordan T.
    Byrd, Alicia K.
    Miah, Sayem
    Byrum, Stephanie D.
    [J]. MOLECULAR OMICS, 2021, 17 (05) : 677 - 691
  • [9] Improving plant gene regulatory network inference by integrative analysis of multi-omics and high resolution data sets
    Qian, Yichun
    Huang, Shao-shan Carol
    [J]. CURRENT OPINION IN SYSTEMS BIOLOGY, 2020, 22 : 8 - 15
  • [10] Optimizing network propagation for multi-omics data integration
    Charmpi, Konstantina
    Chokkalingam, Manopriya
    Johnen, Ronja
    Beyer, Andreas
    [J]. PLOS COMPUTATIONAL BIOLOGY, 2021, 17 (11)