How can machine-learning methods assist in virtual screening for hyperuricemia? A healthcare machine-learning approach

被引：40

作者：

Ichikawa, Daisuke ^{[1
]}

Saito, Toki ^{[1
]}

Ujita, Waka ^{[1
]}

Oyama, Hiroshi ^{[1
]}

机构：

[1] Univ Tokyo, Grad Sch Med, Dep Clin Informat Engn, Div Social Med, Tokyo, Japan

来源：

JOURNAL OF BIOMEDICAL INFORMATICS | 2016年 / 64卷

关键词：

Hyperuricemia; Machine-learning; Prediction; URIC-ACID; CLASSIFICATION; PROGRAM; RISK;

D O I：

10.1016/j.jbi.2016.09.012

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

Object: Our purpose was to develop a new machine-learning approach (a virtual health check-up) toward identification of those at high risk of hyperuricemia. Applying the system to general health check-ups is expected to reduce medical costs compared with administering an additional test. Methods: Data were collected during annual health check-ups performed in Japan between 2011 and 2013 (inclusive). We prepared training and test datasets from the health check-up data to build prediction models; these were composed of 43,524 and 17,789 persons, respectively. Gradient-boosting decision tree (GBDT), random forest (RF), and logistic regression (LR) approaches were trained using the training dataset and were then used to predict hyperuricemia in the test dataset. Undersampling was applied to build the prediction models to deal with the imbalanced class dataset. Results: The results showed that the RF and GBDT approaches afforded the best performances in terms of sensitivity and specificity, respectively. The area under the curve (AUC) values of the models, which reflected the total discriminative ability of the classification, were 0.796 [95% confidence interval (CI): 0.766-0.825] for the GBDT, 0.784 [95% CI: 0.752-0.815] for the RF, and 0.785 [95% CI: 0.752-0.819] for the LR approaches. No significant differences were observed between pairs of each approach. Small changes occurred in the AUCs after applying undersampling to build the models. Conclusions: We developed a virtual health check-up that predicted the development of hyperuricemia using machine-learning methods. The GBDT, RF, and LR methods had similar predictive capability. Undersampling did not remarkably improve predictive power. (C) 2016 Elsevier Inc. All rights reserved.

引用

页码：20 / 24

页数：5

共 50 条

[1] Evaluation of machine-learning methods for ligand-based virtual screening
Beining Chen
Robert F. Harrison
George Papadatos
Peter Willett
David J. Wood
Xiao Qing Lewell
Paulette Greenidge
Nikolaus Stiefl
[J]. Journal of Computer-Aided Molecular Design, 2007, 21 : 53 - 62
[2] Evaluation of machine-learning methods for ligand-based virtual screening
Chen, Beining
Harrison, Robert F.
Papadatos, George
Willett, Peter
Wood, David J.
Lewell, Xiao Qing
Greenidge, Paulette
Stiefl, Nikolaus
[J]. JOURNAL OF COMPUTER-AIDED MOLECULAR DESIGN, 2007, 21 (1-3) : 53 - 62
[3] Machine-Learning Methods for Complex Flows
Vinuesa, Ricardo
Le Clainche, Soledad
[J]. ENERGIES, 2022, 15 (04)
[4] A Machine-Learning Approach to Time Discrimination
Hansen, Peter
[J]. 2010 IEEE NUCLEAR SCIENCE SYMPOSIUM CONFERENCE RECORD (NSS/MIC), 2010, : 2132 - 2133
[5] Can machine-learning methods really help predict suicide?
McHugh, Catherine M.
Large, Matthew M.
[J]. CURRENT OPINION IN PSYCHIATRY, 2020, 33 (04) : 369 - 374
[6] Theory Identity: A Machine-Learning Approach
Larsen, Kai R.
Hovorka, Dirk
West, Jevin
Birt, James
Pfaff, James R.
Chambers, Trevor W.
Sampedro, Zebula R.
Zager, Nick
Vanstone, Bruce
[J]. 2014 47TH HAWAII INTERNATIONAL CONFERENCE ON SYSTEM SCIENCES (HICSS), 2014, : 4639 - 4648
[7] Machine-learning in astronomy
Hobson, Michael
Graff, Philip
Feroz, Farhan
Lasenby, Anthony
[J]. STATISTICAL CHALLENGES IN 21ST CENTURY COSMOLOGY, 2015, 10 (306): : 279 - 287
[8] Machine-learning design
Changjun Zhang
[J]. Nature Energy, 2018, 3 : 535 - 535
[9] Machine-learning design
Zhang, Changjun
[J]. NATURE ENERGY, 2018, 3 (07): : 535 - 535
[10] Machine-Learning the Landscape
He, Yang-Hui
[J]. CALABI-YAU LANDSCAPE: FROM GEOMETRY, TO PHYSICS, TO MACHINE LEARNING, 2021, 2293 : 87 - 130

← 1 2 3 4 5 →