Selective of informative metabolites using random forests based on model population analysis

被引:24
|
作者
Huang, Jian-Hua [1 ]
Yan, Jun [1 ]
Wu, Qing-Hua [1 ]
Ferro, Miguel Duarte [1 ]
Yi, Lun-Zhao [1 ]
Lu, Hong-Mei [1 ]
Xu, Qing-Song [2 ]
Liang, Yi-Zeng [1 ]
机构
[1] Cent South Univ, Res Ctr Modernizat Tradit Chinese Med, Changsha 410083, Hunan, Peoples R China
[2] Cent South Univ, Sch Math Sci & Comp Technol, Changsha 410083, Hunan, Peoples R China
关键词
Random forests (RF); Model population analysis (MPA); Informative metabolite; Feature selection; GAS CHROMATOGRAPHY/MASS SPECTROMETRY; OF-BAG ESTIMATION; FATTY-ACID; PLASMA; RATS; METABOLOMICS; STRAINS; OBESITY; GC/MS;
D O I
10.1016/j.talanta.2013.07.070
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
One of the main goals of metabolomics studies is to discover informative metabolites or biomarkers, which may be used to diagnose diseases and to find out pathology. Sophisticated feature selection approaches are required to extract the information hidden in such complex 'omics' data. In this study, it is proposed a new and robust selective method by combining random forests (RF) with model population analysis (MPA), for selecting informative metabolites from three metabolomic datasets. According to the contribution to the classification accuracy, the metabolites were classified into three kinds: informative, no-informative, and interfering metabolites. Based on the proposed method, some informative metabolites were selected for three datasets; further analyses of these metabolites between healthy and diseased groups were then performed, showing by T-test that the P values for all these selected metabolites were lower than 0.05. Moreover, the informative metabolites identified by the current method were demonstrated to be correlated with the clinical outcome under investigation. The source codes of MPA-RF in Matlab can be freely downloaded from http://code.google.com/p/my-research-list/downloads/list (C) 2013 Elsevier B.V. All rights reserved.
引用
收藏
页码:549 / 555
页数:7
相关论文
共 50 条
  • [31] Bayesian population analysis of a harmonized physiologically based pharmacokinetic model of trichloroethylene and its metabolites
    Hack, C. Eric
    Chiu, Weihsueh A.
    Zhao, Q. Jay
    Clewell, Harvey J.
    [J]. REGULATORY TOXICOLOGY AND PHARMACOLOGY, 2006, 46 (01) : 63 - 83
  • [32] Classification of Soil Types Using Geographic Object-Based Image Analysis and Random Forests
    Andrei DORNIK
    Lucian DRAGUT
    Petru URDEA
    [J]. Pedosphere, 2018, (06) : 913 - 925
  • [33] Rapid Permissions-based Detection and Analysis of Mobile Malware Using Random Decision Forests
    Glodek, William
    Harang, Richard
    [J]. 2013 IEEE MILITARY COMMUNICATIONS CONFERENCE (MILCOM 2013), 2013, : 980 - 985
  • [34] Classification of Soil Types Using Geographic Object-Based Image Analysis and Random Forests
    Dornik, Andrei
    Dragut, Lucian
    Urdea, Petru
    [J]. PEDOSPHERE, 2018, 28 (06) : 913 - 925
  • [35] Classification of Soil Types Using Geographic Object-Based Image Analysis and Random Forests
    Andrei DORNIK
    Lucian DRAGUT
    Petru URDEA
    [J]. Pedosphere, 2018, 28 (06) - 925
  • [36] Example-based feature tweaking using random forests
    Lindgren, Tony
    Papapetrou, Panagiotis
    Samsten, Isak
    Asker, Lars
    [J]. 2019 IEEE 20TH INTERNATIONAL CONFERENCE ON INFORMATION REUSE AND INTEGRATION FOR DATA SCIENCE (IRI 2019), 2019, : 53 - 60
  • [37] Influence Analysis and Prediction of ESDD and NSDD Based on Random Forests
    Ren, Ang
    Li, Qingquan
    Xiao, Huaishuo
    [J]. ENERGIES, 2017, 10 (07)
  • [38] Improved population mapping for China using remotely sensed and points-of-interest data within a random forests model
    Ye, Tingting
    Zhao, Naizhuo
    Yang, Xuchao
    Ouyang, Zutao
    Liu, Xiaoping
    Chen, Qian
    Hu, Kejia
    Yue, Wenze
    Qi, Jiaguo
    Li, Zhansheng
    Jia, Peng
    [J]. SCIENCE OF THE TOTAL ENVIRONMENT, 2019, 658 : 936 - 946
  • [39] Causal Random Forests Model Using Instrumental Variable Quantile Regression
    Chen, Jau-er
    Hsiang, Chen-Wei
    [J]. ECONOMETRICS, 2019, 7 (04)
  • [40] Estimation of retinal vessel caliber using model fitting and random forests
    Araujo, Teresa
    Mendonca, Ana Maria
    Campilho, Aurelio
    [J]. MEDICAL IMAGING 2017: COMPUTER-AIDED DIAGNOSIS, 2017, 10134