Measuring the wisdom of the crowds in network-based gene function inference

被引:15
|
作者
Verleyen, W. [1 ]
Ballouz, S. [1 ]
Gillis, J. [1 ]
机构
[1] Cold Spring Harbor Lab, Stanley Inst Cognit Genom, Woodbury, NY 11797 USA
关键词
PROTEIN FUNCTION; FUNCTION PREDICTION; WEB TOOLS; DATABASE; MOUSE; ANNOTATIONS; INTEGRATION; ALGORITHM; ATLAS;
D O I
10.1093/bioinformatics/btu715
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Network-based gene function inference methods have proliferated in recent years, but measurable progress remains elusive. We wished to better explore performance trends by controlling data and algorithm implementation, with a particular focus on the performance of aggregate predictions. Results: Hypothesizing that popular methods would perform well without hand-tuning, we used well-characterized algorithms to produce verifiably 'untweaked' results. We find that most state-of-the-art machine learning methods obtain 'gold standard' performance as measured in critical assessments in defined tasks. Across a broad range of tests, we see close alignment in algorithm performances after controlling for the underlying data being used. We find that algorithm aggregation provides only modest benefits, with a 17% increase in area under the ROC (AUROC) above the mean AUROC. In contrast, data aggregation gains are enormous with an 88% improvement in mean AUROC. Altogether, we find substantial evidence to support the view that additional algorithm development has little to offer for gene function prediction.
引用
收藏
页码:745 / 752
页数:8
相关论文
共 50 条
  • [1] Wisdom of crowds for robust gene network inference
    Daniel Marbach
    James C Costello
    Robert Küffner
    Nicole M Vega
    Robert J Prill
    Diogo M Camacho
    Kyle R Allison
    Manolis Kellis
    James J Collins
    Gustavo Stolovitzky
    [J]. Nature Methods, 2012, 9 (8) : 796 - 804
  • [2] Wisdom of crowds for robust gene network inference
    Marbach, Daniel
    Costello, James C.
    Kueffner, Robert
    Vega, Nicole M.
    Prill, Robert J.
    Camacho, Diogo M.
    Allison, Kyle R.
    Kellis, Manolis
    Collins, James J.
    Stolovitzky, Gustavo
    [J]. NATURE METHODS, 2012, 9 (08) : 796 - +
  • [3] Prediction of seed gene function in progressive diabetic neuropathy by a network-based inference method
    Li, Shan-Shan
    Zhao, Xin-Bo
    Tian, Jia-Mei
    Wang, Hao-Ren
    Wei, Tong-Huan
    [J]. EXPERIMENTAL AND THERAPEUTIC MEDICINE, 2019, 17 (05) : 4176 - 4182
  • [4] Netmes: Assessing Gene Network Inference Algorithms by Network-Based Measures
    Altay, Gokmen
    Kurt, Zeyneb
    Dehmer, Matthias
    Emmert-Streib, Frank
    [J]. EVOLUTIONARY BIOINFORMATICS, 2014, 10 : 1 - 9
  • [5] NETWORK-BASED GENE FUNCTION INFERENCE APPROACH TO PREDICT OPTIMAL GENE FUNCTIONS RELATED TO AZACITIDINE TREATED MYELODYSPLASIA
    Chen, Hao
    Miao, Jiaqing
    [J]. ACTA MEDICA MEDITERRANEA, 2019, 35 (05): : 2857 - 2861
  • [6] Network-based gene function inference method to predict optimal gene functions associated with fetal growth restriction
    Ye, Ke-Jun
    Dai, Jie
    Liu, Ling-Yun
    Peng, Meng-Jia
    [J]. MOLECULAR MEDICINE REPORTS, 2018, 18 (03) : 3003 - 3010
  • [7] Network-based methods for gene function prediction
    Chen, Qingfeng
    Li, Yongjie
    Tan, Kai
    Qiao, Yvlu
    Pan, Shirui
    Jiang, Taijiao
    Chen, Yi-Ping Phoebe
    [J]. BRIEFINGS IN FUNCTIONAL GENOMICS, 2021, 20 (04) : 249 - 257
  • [8] Prediction of Key Gene Functions Associated with Chronic Hepatitis B Utilizing Network-Based Gene Function Inference Method
    He, Wei-Li
    Gou, Wei
    Wang, Yan-Ling
    Qiao, Bing
    Zhang, Hai-Feng
    [J]. INTERNATIONAL JOURNAL OF HUMAN GENETICS, 2018, 18 (02) : 109 - 115
  • [9] Wisdom of crowds versus wisdom of linguists - measuring the semantic relatedness of words
    Zesch, Torsten
    Gurevych, Iryna
    [J]. NATURAL LANGUAGE ENGINEERING, 2010, 16 : 25 - 59
  • [10] Network dynamics of social influence in the wisdom of crowds
    Becker, Joshua
    Brackbill, Devon
    Centola, Damon
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2017, 114 (26) : E5070 - E5076