Partial Least Squares Discriminant Analysis and Bayesian Networks for Metabolomic Prediction of Childhood Asthma

被引:22
|
作者
Kelly, Rachel S. [1 ,2 ]
McGeachie, Michael J. [1 ,2 ]
Lee-Sarwar, Kathleen A. [1 ,2 ,3 ]
Kachroo, Priyadarshini [1 ,2 ]
Chu, Su H. [1 ,2 ]
Virkud, Yamini V. [1 ,4 ]
Huang, Mengna [1 ,2 ]
Litonjua, Augusto A. [1 ,2 ,5 ]
Weiss, Scott T. [1 ,2 ]
Lasky-Su, Jessica [1 ,2 ]
机构
[1] Brigham & Womens Hosp, Channing Div Network Med, 75 Francis St, Boston, MA 02115 USA
[2] Harvard Med Sch, Boston, MA 02115 USA
[3] Brigham & Womens Hosp, Div Rheumatol Immunol & Allergy, 75 Francis St, Boston, MA 02115 USA
[4] Massachusetts Gen Hosp Children, Dept Pediat, Boston, MA 02114 USA
[5] Univ Rochester, Med Ctr, Dept Pediat, Div Pediat Pulm Med, Rochester, NY 14642 USA
来源
METABOLITES | 2018年 / 8卷 / 04期
关键词
Partial Least-Squares Discriminant analysis; Bayesian networks; asthma; arginine metabolism; overfitting; OPERATING CHARACTERISTIC CURVES; CLASSIFICATION; MICROBIOME;
D O I
10.3390/metabo8040068
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
To explore novel methods for the analysis of metabolomics data, we compared the ability of Partial Least Squares Discriminant Analysis (PLS-DA) and Bayesian networks (BN) to build predictive plasma metabolite models of age three asthma status in 411 three year olds (n = 59 cases and 352 controls) from the Vitamin D Antenatal Asthma Reduction Trial (VDAART) study. The standard PLS-DA approach had impressive accuracy for the prediction of age three asthma with an Area Under the Curve Convex Hull (AUCCH) of 81%. However, a permutation test indicated the possibility of overfitting. In contrast, a predictive Bayesian network including 42 metabolites had a significantly higher AUCCH of 92.1% (p for difference <0.001), with no evidence that this accuracy was due to overfitting. Both models provided biologically informative insights into asthma; in particular, a role for dysregulated arginine metabolism and several exogenous metabolites that deserve further investigation as potential causative agents. As the BN model outperformed the PLS-DA model in both accuracy and decreased risk of overfitting, it may therefore represent a viable alternative to typical analytical approaches for the investigation of metabolomics data.
引用
收藏
页数:18
相关论文
共 50 条
  • [1] Allergenicity prediction by partial least squares-based discriminant analysis
    Naneva, L. H.
    Dimitrov, I. D.
    Bangov, I. P.
    Doytchinova, I. A.
    BULGARIAN CHEMICAL COMMUNICATIONS, 2014, 46 (02): : 389 - 396
  • [2] Powered partial least squares discriminant analysis
    Liland, Kristian Hovde
    Indahl, Ulf Geir
    JOURNAL OF CHEMOMETRICS, 2009, 23 (1-2) : 7 - 18
  • [3] SAS® partial least squares for discriminant analysis
    Reeves, James B., III
    Delwiche, Stephen R.
    JOURNAL OF NEAR INFRARED SPECTROSCOPY, 2008, 16 (01) : 31 - 38
  • [4] Development of partial least squares regression with discriminant for software prediction
    Rajko, Robert
    Siket, Istvan
    Hegedus, Peter
    Ferenc, Rudolf
    HELIYON, 2024, 10 (15)
  • [5] Discriminant partial least squares analysis on compositional data
    Gallo, Michele
    STATISTICAL MODELLING, 2010, 10 (01) : 41 - 56
  • [6] A Joint Bayesian Framework Based on Partial Least Squares Discriminant Analysis for Finger Vein Recognition
    Zhang, Liping
    Sun, Linjun
    Li, Weijun
    Zhang, Jianming
    Cai, Weiwei
    Cheng, Chuantong
    Ning, Xin
    IEEE SENSORS JOURNAL, 2022, 22 (01) : 785 - 794
  • [7] Variable selection in discriminant partial least-squares analysis
    Alsberg, BK
    Kell, DB
    Goodacre, R
    ANALYTICAL CHEMISTRY, 1998, 70 (19) : 4126 - 4133
  • [8] Partial least squares discriminant analysis: taking the magic away
    Brereton, Richard G.
    Lloyd, Gavin R.
    JOURNAL OF CHEMOMETRICS, 2014, 28 (04) : 213 - 225
  • [9] Two approaches for discriminant partial least squares
    Sabatier, R
    Vivien, M
    Amenta, P
    BETWEEN DATA SCIENCE AND APPLIED DATA ANALYSIS, 2003, : 100 - 108
  • [10] Partial Least Square Discriminant Analysis for bankruptcy prediction
    Serrano-Cinca, Carlos
    Gutierrez-Nieto, Begona
    DECISION SUPPORT SYSTEMS, 2013, 54 (03) : 1245 - 1255