Kernel-based data fusion for gene prioritization

被引:89
|
作者
De Bie, Tijl
Tranchevent, Leon-Charles
Van Oeffelen, Liesbeth M. M.
Moreau, Yves
机构
[1] Univ Bristol, Dept Engn Math, Bristol BS8 1TR, Avon, England
[2] Katholieke Univ Leuven, OKP Res Grp, B-3000 Louvain, Belgium
[3] Katholieke Univ Leuven, ESAT SCD, B-3001 Louvain, Belgium
关键词
D O I
10.1093/bioinformatics/btm187
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Hunting disease genes is a problem of primary importance in biomedical research. Biologists usually approach this problem in two steps: first a set of candidate genes is identified using traditional positional cloning or high- throughput genomics techniques; second, these genes are further investigated and validated in the wet lab, one by one. To speed up discovery and limit the number of costly wet lab experiments, biologists must test the candidate genes starting with the most probable candidates. So far, biologists have relied on literature studies, extensive queries to multiple databases and hunches about expected properties of the disease gene to determine such an ordering. Recently, we have introduced the data mining tool ENDEAVOUR (Aerts et al., 2006), which performs this task automatically by relying on different genome-wide data sources, such as Gene Ontology, literature, microarray, sequence and more. Results: In this article, we present a novel kernel method that operates in the same setting: based on a number of different views on a set of training genes, a prioritization of test genes is obtained. We furthermore provide a thorough learning theoretical analysis of the method's guaranteed performance. Finally, we apply the method to the disease data sets on which ENDEAVOUR (Aerts et al., 2006) has been benchmarked, and report a considerable improvement in empirical performance.
引用
收藏
页码:I125 / I132
页数:8
相关论文
共 50 条
  • [1] Scuba: scalable kernel-based gene prioritization
    Guido Zampieri
    Dinh Van Tran
    Michele Donini
    Nicolò Navarin
    Fabio Aiolli
    Alessandro Sperduti
    Giorgio Valle
    [J]. BMC Bioinformatics, 19
  • [2] Scuba: scalable kernel-based gene prioritization
    Zampieri, Guido
    Dinh Van Tran
    Donini, Michele
    Navarin, Nicolo
    Aiolli, Fabio
    Sperduti, Alessandro
    Valle, Giorgio
    [J]. BMC BIOINFORMATICS, 2018, 19
  • [3] Gene Prioritization Through Geometric -Inspired Kernel Data Fusion
    Zakeri, Pooya
    Elshal, Sarah
    Moreau, Yves
    [J]. PROCEEDINGS 2015 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2015, : 1559 - 1565
  • [4] A kernel-based clustering method for gene selection with gene expression data
    Chen, Huihui
    Zhang, Yusen
    Gutman, Ivan
    [J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2016, 62 : 12 - 20
  • [5] Kernel-based data fusion and its application to protein function prediction in yeast
    Lanckriet, GRG
    Deng, M
    Cristianini, N
    Jordan, MI
    Noble, WS
    [J]. PACIFIC SYMPOSIUM ON BIOCOMPUTING 2004, 2003, : 300 - 311
  • [6] On the Spectral Property of Kernel-Based Sensor Fusion Algorithms of High Dimensional Data
    Ding, Xiucai
    Wu, Hau-Tieng
    [J]. IEEE TRANSACTIONS ON INFORMATION THEORY, 2021, 67 (01) : 640 - 670
  • [7] Kernel-based data fusion improves the drug-protein interaction prediction
    Wang, Yong-Cui
    Zhang, Chun-Hua
    Deng, Nai-Yang
    Wang, Yong
    [J]. COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2011, 35 (06) : 353 - 362
  • [8] Gene prioritization by genomic data fusion
    Van Loo, Peter
    Aerts, Stein
    Lambrechts, Diether
    Thienpont, Bernard
    Maity, Sunit
    Coessens, Bert
    De Smet, Frederik
    Tranchevent, Leon-Charles
    De Moor, Bart
    Devriendt, Koen
    Marynen, Peter
    Hassan, Bassem
    Carmeliet, Peter
    Moreau, Yves
    [J]. ANNALS OF HUMAN GENETICS, 2007, 71 : 550 - 551
  • [9] A Novel Kernel-based Gene Selection and Classification Scheme for Microarray Data
    Huang, Hsiao-Yun
    Chang, Hui-Yi
    Liu, Jeng-Fu
    [J]. 6TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING AND INTELLIGENT SYSTEMS, AND THE 13TH INTERNATIONAL SYMPOSIUM ON ADVANCED INTELLIGENT SYSTEMS, 2012, : 1679 - 1683
  • [10] Kernel-based grouping of histogram data
    Lange, Tilman
    Buhmann, Joachim M.
    [J]. MACHINE LEARNING: ECML 2007, PROCEEDINGS, 2007, 4701 : 632 - +