Analyzing omics data based on sample network

被引:1
|
作者
Sheng, Meizhen [1 ]
Qi, Yanpeng [1 ]
Gao, Zhenbo [1 ]
Lin, Xiaohui [1 ]
机构
[1] Dalian Univ Technol, Sch Comp Sci & Technol, 2 Linggong Rd, Dalian 116024, Liaoning, Peoples R China
关键词
Sample networks; feature selection; omics data analysis; EXPRESSION; CANCER; CLASSIFICATION; SELECTION; MICRORNA;
D O I
10.1142/S0219720024500021
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Identifying valuable features from complex omics data is of great significance for disease diagnosis study. This paper proposes a new feature selection algorithm based on sample network (FS-SN) to mine important information from omics data. The sample network is constructed according to the sample neighbor relationship at the molecular (feature) expression level, and the distinguishing ability of the feature is evaluated based on the topology of the sample network. The sample network established on a feature with a strong discriminating ability tends to have many edges between the same group samples and few edges between the different group samples. At the same time, FS-SN removes redundant features according to the gravitational interaction between features. To show the validation of FS-SN, it was compared on ten public datasets with ERGS, mRMR, ReliefF, ATSD-DN, and INDEED which are efficient in omics data analysis. Experimental results show that FS-SN performed better than the compared methods in accuracy, sensitivity and specificity in most cases. Hence, FS-SN making use of the topology of the sample network is effective for analyzing omics data, it can identify key features that reflect the occurrence and development of diseases, and reveal the underlying biological mechanism.
引用
收藏
页数:16
相关论文
共 50 条
  • [31] Network-based integration of multi-omics data for prioritizing cancer genes
    Dimitrakopoulos, Christos
    Hindupur, Sravanth Kumar
    Haefliger, Luca
    Behr, Jonas
    Montazeri, Hesam
    Hall, Michael N.
    Beerenwinkel, Niko
    BIOINFORMATICS, 2018, 34 (14) : 2441 - 2448
  • [32] Genomic prediction with NetGP based on gene network and multi-omics data in plants
    Zhao, Longyang
    Tang, Ping
    Luo, Jinjing
    Liu, Jianxiang
    Peng, Xin
    Shen, Mengyuan
    Wang, Chengrui
    Zhao, Junliang
    Zhou, Degui
    Fan, Zhilan
    Chen, Yibo
    Wang, Runfeng
    Tang, Xiaoyan
    Xu, Zhi
    Liu, Qi
    PLANT BIOTECHNOLOGY JOURNAL, 2025, 23 (04) : 1190 - 1201
  • [33] An omics data analysis method based on feature linear relationship and graph convolutional network
    Zhang, Yanhui
    Lin, Xiaohui
    Gao, Zhenbo
    Wang, Tianxiang
    Dong, Kunjie
    Zhang, Jianjun
    JOURNAL OF BIOMEDICAL INFORMATICS, 2023, 145
  • [34] Integrating Omics Data for Identifying Disease Subtypes: a Multiplex Network-based Approach
    Wang, Haiying
    Zheng, Huiru
    PROCEEDINGS 2015 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2015, : 581 - 586
  • [35] Analyzing concept drift and shift from sample data
    Webb, Geoffrey I.
    Lee, Loong Kuan
    Goethals, Bart
    Petitjean, Francois
    DATA MINING AND KNOWLEDGE DISCOVERY, 2018, 32 (05) : 1179 - 1199
  • [36] Integrating Omics Data With a Multiplex Network-Based Approach for the Identification of Cancer Subtypes
    Wang, Haiying
    Zheng, Huiru
    Wang, Jianxin
    Wang, Chaoyang
    Wu, Fang-Xiang
    IEEE TRANSACTIONS ON NANOBIOSCIENCE, 2016, 15 (04) : 335 - 342
  • [37] Improving stability of prediction models based on correlated omics data by using network approaches
    Tissier, Renaud
    Houwing-Duistermaat, Jeanine
    Rodriguez-Girondo, Mar
    PLOS ONE, 2018, 13 (02):
  • [38] Analyzing concept drift and shift from sample data
    Geoffrey I. Webb
    Loong Kuan Lee
    Bart Goethals
    François Petitjean
    Data Mining and Knowledge Discovery, 2018, 32 : 1179 - 1199
  • [39] MOOMIN - Mathematical explOration of 'Omics data on a MetabolIc Network
    Pusa, Taneli
    Ferrarini, Mariana Galvao
    Andrade, Ricardo
    Mary, Arnaud
    Marchetti-Spaccamela, Alberto
    Stougie, Leen
    Sagot, Marie-France
    BIOINFORMATICS, 2020, 36 (02) : 514 - 523
  • [40] Variable selection in omics data: A practical evaluation of small sample sizes
    Kirpich, Alexander
    Ainsworth, Elizabeth A.
    Wedow, Jessica M.
    Newman, Jeremy R. B.
    Michailidis, George
    McIntyre, Lauren M.
    PLOS ONE, 2018, 13 (06):