Network-Constrained Forest for Regularized Omics Data Classification

被引:0
|
作者
Andel, Michael [1 ]
Klema, Jiri [1 ]
Krejcik, Zdenek [2 ]
机构
[1] Czech Tech Univ, Dept Comp Sci, Tech 2, CR-16635 Prague, Czech Republic
[2] Univ Nemocnice, Inst Hematol & Blood Transfus, Dept Mol Genet, Prague, Czech Republic
关键词
GENE SELECTION; CBL; EXPRESSION; CANCER; MUTATIONS; DISCOVERY; KNOWLEDGE; PROTEINS; KINASE; NPM1;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Contemporary molecular biology deals with a wide and heterogeneous set of measurements to model and understand underlying biological processes including complex diseases. Machine learning provides a frequent approach to build such models. However, the models built solely from measured data often suffer from overfitting, as the sample size is typically much smaller than the number of measured features. In this paper, we propose a random forest-based classifier that minimizes this overfitting with the aid of prior knowledge in the form of a feature interaction network. We illustrate the proposed method in the task of disease classification based on measured mRNA and miRNA profiles complemented by the interaction network composed of the miRNA-mRNA target relations and mRNA-mRNA interactions corresponding to the interactions between their encoded proteins. We demonstrate that the proposed network-constrained forest employs prior knowledge to increase learning bias and consequently to improve classification accuracy, stability and comprehensibility of the resulting model. The experiments are carried out in the domain of myelodysplastic syndrome that we are concerned about in the long term. We validate our approach in the public domain of ovarian carcinoma, with the same data form. We believe that the idea of a network-constrained forest can straightforwardly be generalized towards arbitrary omics data with an available and non-trivial feature interaction network.
引用
收藏
页数:8
相关论文
共 50 条
  • [21] Efficient In-Memory Indexing of Network-Constrained Trajectories
    Krogh, Benjamin
    Jensen, Christian S.
    Torp, Kristian
    [J]. 24TH ACM SIGSPATIAL INTERNATIONAL CONFERENCE ON ADVANCES IN GEOGRAPHIC INFORMATION SYSTEMS (ACM SIGSPATIAL GIS 2016), 2016,
  • [22] Network-Constrained Production Optimization by Means of Multiple Shooting
    Silva, Thiago Lima
    Codas, Andres
    Stanko, Milan
    Camponogara, Eduardo
    Foss, Bjarne
    [J]. SPE RESERVOIR EVALUATION & ENGINEERING, 2019, 22 (02) : 709 - 733
  • [23] Index method for tracking network-constrained moving objects
    Feng, Jun
    Lu, Jiamin
    Zhu, Yuelong
    Watanabe, Toyohide
    [J]. KNOWLEDGE-BASED INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS, PT 2, PROCEEDINGS, 2008, 5178 : 551 - +
  • [24] Identification of Price Zones in Network-Constrained Electricity Markets
    Moreno, Ricardo
    Pozo, David
    [J]. 2021 IEEE PES INNOVATIVE SMART GRID TECHNOLOGY EUROPE (ISGT EUROPE 2021), 2021, : 121 - 126
  • [25] Tracking network-constrained moving objects with group updates
    Chen, Jidong
    Meng, Xiaofeng
    Li, Benzhao
    Lai, Caifeng
    [J]. ADVANCES IN WEB-AGE INFORMATION MANAGEMENT, PROCEEDINGS, 2006, 4016 : 158 - 169
  • [26] Linear structure index for network-constrained moving objects
    Wang, Qianqiu
    Nong, Ge
    Wu, Wenbo
    [J]. JOURNAL OF SUPERCOMPUTING, 2024, 80 (05): : 6192 - 6220
  • [27] Network-Constrained Adaptive Control with a Nonlinear Tracking Function
    Sharon, Yoav
    Annaswamy, Anuradha M.
    Motto, Alexis L.
    [J]. 2012 6TH INTERNATIONAL CONFERENCE ON NETWORK GAMES, CONTROL AND OPTIMIZATION (NETGCOOP), 2012, : 116 - 120
  • [28] Linear structure index for network-constrained moving objects
    Qianqiu Wang
    Ge Nong
    Wenbo Wu
    [J]. The Journal of Supercomputing, 2024, 80 : 6192 - 6220
  • [29] Location update strategies for network-constrained moving objects
    Ding, Zhiming
    Zhou, Xiaofang
    [J]. DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, 2008, 4947 : 644 - +
  • [30] Network-constrained bidding optimization strategy for aggregators of prosumers
    Iria, Jose
    Scott, Paul
    Attarha, Ahmad
    [J]. ENERGY, 2020, 207