Tree-based ensemble methods and their applications in analytical chemistry

被引:21
|
作者
Cao, Dong-Sheng [1 ]
Xu, Qing-Song [2 ]
Zhang, Liang-Xiao [3 ]
Huang, Jian-Hua [1 ]
Liang, Yi-Zeng [1 ]
机构
[1] Cent South Univ, Res Ctr Modernizat Tradit Chinese Med, Changsha 410083, Peoples R China
[2] Cent South Univ, Sch Math & Stat, Changsha 410083, Peoples R China
[3] Chinese Acad Sci, Dalian Inst Chem Phys, Key Lab Separat Sci Analyt Chem, Dalian 116023, Peoples R China
基金
中国国家自然科学基金;
关键词
Chemometrics; Classification and regression tree (CART); Cluster analysis; Complex data; Ensemble algorithm; Kernel method; Outlier detection; Pattern analysis; Tree-based ensemble; Variable selection; MULTIVARIATE REGRESSION TREES; RANDOM FOREST; OUTLIER DETECTION; FEATURE-SELECTION; COMPOUND CLASSIFICATION; DECISION TREES; PREDICTION; ALGORITHM; TOOL; ELIMINATION;
D O I
10.1016/j.trac.2012.07.012
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Large amounts of data from high-throughput analytical instruments have generally become more and more complex, bringing a number of challenges to statistical modeling. To understand complex data further, new statistically-efficient approaches are urgently needed to: (1) select salient features from the data; (2) discard uninformative data; (3) detect outlying samples in data; (4) visualize existing patterns of the data; (5) improve the prediction accuracy of the data; and, finally, (6) feed back to the analyst understandable summaries of information from the data. We review current developments in tree-based ensemble methods to mine effectively the knowledge hidden in chemical and biology data. We report on applications of these algorithms to variable selection, outlier detection, supervised pattern analysis, cluster analysis, and tree-based kernel and ensemble learning. Through this report, we wish to inspire chemists to take greater interest in decision trees and to obtain greater benefits from using the tree-based ensemble techniques. (C) 2012 Elsevier Ltd. All rights reserved.
引用
收藏
页码:158 / 167
页数:10
相关论文
共 50 条
  • [21] Tree-based methods: an application to disability probabilities
    Bauer, Marcus
    Krueger, Ralf
    Olbricht, Walter
    EUROPEAN ACTUARIAL JOURNAL, 2013, 3 (02) : 491 - 513
  • [22] Tree-Based Ensemble Learning Techniques in the Analysis of Parkinsonian Syndromes
    Gorriz, J. M.
    Ramirez, J.
    Moreno-Caballero, M.
    Martinez-Murcia, F. J.
    Ortiz, A.
    Illan, I. A.
    Segovia, F.
    Salas-Gonzalez, D.
    Gomez-Rio, M.
    MEDICAL IMAGE UNDERSTANDING AND ANALYSIS (MIUA 2017), 2017, 723 : 459 - 469
  • [23] Tree-based censored regression with applications in insurance
    Lopez, Olivier
    Milhaud, Xavier
    Therond, Pierre-E.
    ELECTRONIC JOURNAL OF STATISTICS, 2016, 10 (02): : 2685 - 2716
  • [24] Tree-based methods for individualized treatment regimes
    Laber, E. B.
    Zhao, Y. Q.
    BIOMETRIKA, 2015, 102 (03) : 501 - 514
  • [25] Tree-based methods for classifying software failures
    Francis, P
    Leon, D
    Minch, M
    Podgurski, A
    15TH INTERNATIONAL SYMPOSIUM ON SOFTWARE RELIABILITY ENGINEERING, PROCEEDINGS, 2004, : 451 - 462
  • [26] Tree-based ensemble methods for predicting the module temperature of a grid-tied photovoltaic system in the desert
    Ziane, Abderrezzaq
    Dabou, Rachid
    Necaibia, Ammar
    Sahouane, Nordine
    Mostefaoui, Mohammed
    Bouraiou, Ahmed
    Khelifi, Seyfallah
    Rouabhia, Abdelkrim
    Blal, Mohamed
    INTERNATIONAL JOURNAL OF GREEN ENERGY, 2021, 18 (13) : 1430 - 1440
  • [27] Tree-based ensemble methods for sensitivity analysis of environmental models: A performance comparison with Sobol and Morris techniques
    Jaxa-Rozen, Marc
    Kwakkel, Jan
    ENVIRONMENTAL MODELLING & SOFTWARE, 2018, 107 : 245 - 266
  • [28] RADE: resource-efficient supervised anomaly detection using decision tree-based ensemble methods
    Vargaftik, Shay
    Keslassy, Isaac
    Orda, Ariel
    Ben-Itzhak, Yaniv
    MACHINE LEARNING, 2021, 110 (10) : 2835 - 2866
  • [29] RADE: resource-efficient supervised anomaly detection using decision tree-based ensemble methods
    Shay Vargaftik
    Isaac Keslassy
    Ariel Orda
    Yaniv Ben-Itzhak
    Machine Learning, 2021, 110 : 2835 - 2866
  • [30] Using Ensemble-Based Methods for Directly Estimating Causal Effects: An Investigation of Tree-Based G-Computation
    Austin, Peter C.
    MULTIVARIATE BEHAVIORAL RESEARCH, 2012, 47 (01) : 115 - 135