Motivation: Classification algorithms for high-dimensional biological data like gene expression profiles or metabolomic fingerprints are typically evaluated by the number of misclassifications across a test dataset. However, to judge the classification of a single case in the context of clinical diagnosis, we need to assess the uncertainties associated with that individual case rather than the average accuracy across many cases. Reliability of individual classifications can be expressed in terms of class probabilities. While classification algorithms are a well-developed area of research, the estimation of class probabilities is considerably less progressed in biology, with only a few classification algorithms that provide estimated class probabilities. Results: We compared several probability estimators in the context of classification of metabolomics profiles. Evaluation criteria included sparseness biases, calibration of the estimator, the variance of the estimator and its performance in identifying highly reliable classifications. We observed that several of them display artifacts that compromise their use in practice. Classification probabilities based on a combination of local cross-validation error rates and monotone regression prove superior in metabolomic profiling.
机构:
Tianjin Univ, Ctr Appl Math, Tianjin 300072, Peoples R ChinaTianjin Univ, Ctr Appl Math, Tianjin 300072, Peoples R China
Zhang, Haixiang
Zheng, Yinan
论文数: 0引用数: 0
h-index: 0
机构:
Northwestern Univ, Dept Prevent Med, Chicago, IL 60611 USATianjin Univ, Ctr Appl Math, Tianjin 300072, Peoples R China
Zheng, Yinan
Zhang, Zhou
论文数: 0引用数: 0
h-index: 0
机构:
Northwestern Univ, Dept Prevent Med, Chicago, IL 60611 USATianjin Univ, Ctr Appl Math, Tianjin 300072, Peoples R China
Zhang, Zhou
Gao, Tao
论文数: 0引用数: 0
h-index: 0
机构:
Northwestern Univ, Dept Prevent Med, Chicago, IL 60611 USATianjin Univ, Ctr Appl Math, Tianjin 300072, Peoples R China
Gao, Tao
Joyce, Brian
论文数: 0引用数: 0
h-index: 0
机构:
Northwestern Univ, Dept Prevent Med, Chicago, IL 60611 USATianjin Univ, Ctr Appl Math, Tianjin 300072, Peoples R China
Joyce, Brian
Yoon, Grace
论文数: 0引用数: 0
h-index: 0
机构:
Northwestern Univ, Dept Stat, Chicago, IL 60611 USATianjin Univ, Ctr Appl Math, Tianjin 300072, Peoples R China
Yoon, Grace
Zhang, Wei
论文数: 0引用数: 0
h-index: 0
机构:
Northwestern Univ, Dept Prevent Med, Chicago, IL 60611 USATianjin Univ, Ctr Appl Math, Tianjin 300072, Peoples R China
Zhang, Wei
Schwartz, Joel
论文数: 0引用数: 0
h-index: 0
机构:
Harvard Univ, Dept Environm Hlth, Boston, MA 02115 USATianjin Univ, Ctr Appl Math, Tianjin 300072, Peoples R China
Schwartz, Joel
Just, Allan
论文数: 0引用数: 0
h-index: 0
机构:
Icahn Sch Med Mt Sinai, Dept Prevent Med, New York, NY 10029 USATianjin Univ, Ctr Appl Math, Tianjin 300072, Peoples R China
Just, Allan
Colicino, Elena
论文数: 0引用数: 0
h-index: 0
机构:
Harvard Univ, Dept Environm Hlth, Boston, MA 02115 USATianjin Univ, Ctr Appl Math, Tianjin 300072, Peoples R China
Colicino, Elena
Vokonas, Pantel
论文数: 0引用数: 0
h-index: 0
机构:
Vet Affairs Boston Healthcare Syst, Boston, MA 02118 USA
Boston Univ, Sch Med, VA Normat Aging Study, Boston, MA 02118 USATianjin Univ, Ctr Appl Math, Tianjin 300072, Peoples R China
Vokonas, Pantel
Zhao, Lihui
论文数: 0引用数: 0
h-index: 0
机构:
Northwestern Univ, Dept Prevent Med, Chicago, IL 60611 USATianjin Univ, Ctr Appl Math, Tianjin 300072, Peoples R China
Zhao, Lihui
Lv, Jinchi
论文数: 0引用数: 0
h-index: 0
机构:
Univ Southern Calif, Data Sci & Operat Dept, Los Angeles, CA 90089 USATianjin Univ, Ctr Appl Math, Tianjin 300072, Peoples R China
Lv, Jinchi
Baccarelli, Andrea
论文数: 0引用数: 0
h-index: 0
机构:
Harvard Univ, Dept Environm Hlth, Boston, MA 02115 USATianjin Univ, Ctr Appl Math, Tianjin 300072, Peoples R China
Baccarelli, Andrea
Hou, Lifang
论文数: 0引用数: 0
h-index: 0
机构:
Northwestern Univ, Dept Prevent Med, Chicago, IL 60611 USATianjin Univ, Ctr Appl Math, Tianjin 300072, Peoples R China
Hou, Lifang
Liu, Lei
论文数: 0引用数: 0
h-index: 0
机构:
Northwestern Univ, Dept Prevent Med, Chicago, IL 60611 USATianjin Univ, Ctr Appl Math, Tianjin 300072, Peoples R China