Tree-based ensemble methods and their applications in analytical chemistry

被引:21
|
作者
Cao, Dong-Sheng [1 ]
Xu, Qing-Song [2 ]
Zhang, Liang-Xiao [3 ]
Huang, Jian-Hua [1 ]
Liang, Yi-Zeng [1 ]
机构
[1] Cent South Univ, Res Ctr Modernizat Tradit Chinese Med, Changsha 410083, Peoples R China
[2] Cent South Univ, Sch Math & Stat, Changsha 410083, Peoples R China
[3] Chinese Acad Sci, Dalian Inst Chem Phys, Key Lab Separat Sci Analyt Chem, Dalian 116023, Peoples R China
基金
中国国家自然科学基金;
关键词
Chemometrics; Classification and regression tree (CART); Cluster analysis; Complex data; Ensemble algorithm; Kernel method; Outlier detection; Pattern analysis; Tree-based ensemble; Variable selection; MULTIVARIATE REGRESSION TREES; RANDOM FOREST; OUTLIER DETECTION; FEATURE-SELECTION; COMPOUND CLASSIFICATION; DECISION TREES; PREDICTION; ALGORITHM; TOOL; ELIMINATION;
D O I
10.1016/j.trac.2012.07.012
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Large amounts of data from high-throughput analytical instruments have generally become more and more complex, bringing a number of challenges to statistical modeling. To understand complex data further, new statistically-efficient approaches are urgently needed to: (1) select salient features from the data; (2) discard uninformative data; (3) detect outlying samples in data; (4) visualize existing patterns of the data; (5) improve the prediction accuracy of the data; and, finally, (6) feed back to the analyst understandable summaries of information from the data. We review current developments in tree-based ensemble methods to mine effectively the knowledge hidden in chemical and biology data. We report on applications of these algorithms to variable selection, outlier detection, supervised pattern analysis, cluster analysis, and tree-based kernel and ensemble learning. Through this report, we wish to inspire chemists to take greater interest in decision trees and to obtain greater benefits from using the tree-based ensemble techniques. (C) 2012 Elsevier Ltd. All rights reserved.
引用
收藏
页码:158 / 167
页数:10
相关论文
共 50 条
  • [1] Classification of repeated measurements data using tree-based ensemble methods
    Werner Adler
    Sergej Potapov
    Berthold Lausen
    Computational Statistics, 2011, 26
  • [2] Exploring the potential of tree-based ensemble methods in solar radiation modeling
    Hassan, Muhammed A.
    Khalil, A.
    Kaseb, S.
    Kassem, M. A.
    APPLIED ENERGY, 2017, 203 : 897 - 916
  • [3] Predicting musculoskeletal disorders risk using tree-based ensemble methods
    Paraponaris, A.
    Ba, A.
    Gallic, E.
    Liance, Q.
    Michel, Pierre
    EUROPEAN JOURNAL OF PUBLIC HEALTH, 2019, 29
  • [4] Classification of repeated measurements data using tree-based ensemble methods
    Adler, Werner
    Potapov, Sergej
    Lausen, Berthold
    COMPUTATIONAL STATISTICS, 2011, 26 (02) : 355 - 369
  • [5] Evaluating the usage of tree-based ensemble methods in groundwater spring potential mapping
    Chen, Wei
    Zhao, Xia
    Tsangaratos, Paraskevas
    Shahabi, Himan
    Ilia, Ioanna
    Xue, Weifeng
    Wang, Xiaojing
    Bin Ahmad, Baharin
    JOURNAL OF HYDROLOGY, 2020, 583
  • [6] Tree-Based Ranking Methods
    Clemencon, Stephan
    Vayatis, Nicolas
    IEEE TRANSACTIONS ON INFORMATION THEORY, 2009, 55 (09) : 4316 - 4336
  • [7] Comparison of tree-based ensemble models for regression
    Park, Sangho
    Kim, Chanmin
    COMMUNICATIONS FOR STATISTICAL APPLICATIONS AND METHODS, 2022, 29 (05) : 561 - 590
  • [8] Tree-Based Ensemble Models and Algorithms for Classification
    Tsiligaridis, J.
    2023 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE IN INFORMATION AND COMMUNICATION, ICAIIC, 2023, : 103 - 106
  • [9] Comparing performance of non–tree-based and tree-based association mapping methods
    Katherine L. Thompson
    David W. Fardo
    BMC Proceedings, 10 (Suppl 7)
  • [10] Comparing Predicted Historical Distributions of Tree Species Using Two Tree-based Ensemble Classification Methods
    Hanberry, Brice B.
    He, Hong S.
    Palik, Brian J.
    AMERICAN MIDLAND NATURALIST, 2012, 168 (02): : 443 - 455