Tree-based ensemble methods and their applications in analytical chemistry

被引:21
|
作者
Cao, Dong-Sheng [1 ]
Xu, Qing-Song [2 ]
Zhang, Liang-Xiao [3 ]
Huang, Jian-Hua [1 ]
Liang, Yi-Zeng [1 ]
机构
[1] Cent South Univ, Res Ctr Modernizat Tradit Chinese Med, Changsha 410083, Peoples R China
[2] Cent South Univ, Sch Math & Stat, Changsha 410083, Peoples R China
[3] Chinese Acad Sci, Dalian Inst Chem Phys, Key Lab Separat Sci Analyt Chem, Dalian 116023, Peoples R China
基金
中国国家自然科学基金;
关键词
Chemometrics; Classification and regression tree (CART); Cluster analysis; Complex data; Ensemble algorithm; Kernel method; Outlier detection; Pattern analysis; Tree-based ensemble; Variable selection; MULTIVARIATE REGRESSION TREES; RANDOM FOREST; OUTLIER DETECTION; FEATURE-SELECTION; COMPOUND CLASSIFICATION; DECISION TREES; PREDICTION; ALGORITHM; TOOL; ELIMINATION;
D O I
10.1016/j.trac.2012.07.012
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Large amounts of data from high-throughput analytical instruments have generally become more and more complex, bringing a number of challenges to statistical modeling. To understand complex data further, new statistically-efficient approaches are urgently needed to: (1) select salient features from the data; (2) discard uninformative data; (3) detect outlying samples in data; (4) visualize existing patterns of the data; (5) improve the prediction accuracy of the data; and, finally, (6) feed back to the analyst understandable summaries of information from the data. We review current developments in tree-based ensemble methods to mine effectively the knowledge hidden in chemical and biology data. We report on applications of these algorithms to variable selection, outlier detection, supervised pattern analysis, cluster analysis, and tree-based kernel and ensemble learning. Through this report, we wish to inspire chemists to take greater interest in decision trees and to obtain greater benefits from using the tree-based ensemble techniques. (C) 2012 Elsevier Ltd. All rights reserved.
引用
收藏
页码:158 / 167
页数:10
相关论文
共 50 条
  • [41] A novel tree-based dynamic heterogeneous ensemble method for credit scoring
    Xia, Yufei
    Zhao, Junhao
    He, Lingyun
    Li, Yinguo
    Niu, Mengyi
    EXPERT SYSTEMS WITH APPLICATIONS, 2020, 159
  • [42] Natural mortality estimation using tree-based ensemble learning models
    Liu, Chanjuan
    Zhou, Shijie
    Wang, You-Gan
    Hu, Zhihua
    ICES JOURNAL OF MARINE SCIENCE, 2020, 77 (04) : 1414 - 1426
  • [43] Evaluating Tree-based Ensemble Strategies for Imbalanced Network Attack Classification
    Soon, Hui Fern
    Amir, Amiza
    Nishizaki, Hiromitsu
    Zahri, Nik Adilah Hanin
    Kamarudin, Latifah Munirah
    Azemi, Saidatul Norlyana
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (01) : 1124 - 1134
  • [44] Evaluation of tree-based ensemble learning algorithms for building energy performanceestimation
    Papadopoulos, Sokratis
    Azar, Elie
    Woon, Wei-Lee
    Kontokosta, Constantine E.
    JOURNAL OF BUILDING PERFORMANCE SIMULATION, 2018, 11 (03) : 322 - 332
  • [45] A Comparative Analysis of Tree-Based Ensemble Methods for Detecting Imminent Lane Change Maneuvers in Connected Vehicle Environments
    Mousa, Saleh R.
    Bakhit, Peter R.
    Osman, Osama A.
    Ishak, Sherif
    TRANSPORTATION RESEARCH RECORD, 2018, 2672 (42) : 268 - 279
  • [46] Toward explainable electrical load forecasting of buildings: A comparative study of tree-based ensemble methods with Shapley values
    Moon, Jihoon
    Rho, Seungmin
    Baik, Sung Wook
    SUSTAINABLE ENERGY TECHNOLOGIES AND ASSESSMENTS, 2022, 54
  • [47] SOME APPLICATIONS OF TREE-BASED MODELING TO SPEECH AND LANGUAGE
    RILEY, MD
    SPEECH AND NATURAL LANGUAGE, 1989, : 339 - 352
  • [48] Exploiting Categorical Structure Using Tree-Based Methods
    Lucena, Brian
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 108, 2020, 108 : 2949 - 2957
  • [49] Tree-based Methods for Characterizing Tumor Density Heterogeneity
    Shoemaker, Katherine
    Hobbs, Brian P.
    Bharath, Karthik
    Ng, Chaan S.
    Baladandayuthapani, Veerabhadran
    PACIFIC SYMPOSIUM ON BIOCOMPUTING 2018 (PSB), 2018, : 216 - 227
  • [50] Tree-based methods: a useful tool for life insurance
    Olbricht, Walter
    EUROPEAN ACTUARIAL JOURNAL, 2012, 2 (01) : 129 - 147