Tree-based ensemble methods and their applications in analytical chemistry

被引:21
|
作者
Cao, Dong-Sheng [1 ]
Xu, Qing-Song [2 ]
Zhang, Liang-Xiao [3 ]
Huang, Jian-Hua [1 ]
Liang, Yi-Zeng [1 ]
机构
[1] Cent South Univ, Res Ctr Modernizat Tradit Chinese Med, Changsha 410083, Peoples R China
[2] Cent South Univ, Sch Math & Stat, Changsha 410083, Peoples R China
[3] Chinese Acad Sci, Dalian Inst Chem Phys, Key Lab Separat Sci Analyt Chem, Dalian 116023, Peoples R China
基金
中国国家自然科学基金;
关键词
Chemometrics; Classification and regression tree (CART); Cluster analysis; Complex data; Ensemble algorithm; Kernel method; Outlier detection; Pattern analysis; Tree-based ensemble; Variable selection; MULTIVARIATE REGRESSION TREES; RANDOM FOREST; OUTLIER DETECTION; FEATURE-SELECTION; COMPOUND CLASSIFICATION; DECISION TREES; PREDICTION; ALGORITHM; TOOL; ELIMINATION;
D O I
10.1016/j.trac.2012.07.012
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Large amounts of data from high-throughput analytical instruments have generally become more and more complex, bringing a number of challenges to statistical modeling. To understand complex data further, new statistically-efficient approaches are urgently needed to: (1) select salient features from the data; (2) discard uninformative data; (3) detect outlying samples in data; (4) visualize existing patterns of the data; (5) improve the prediction accuracy of the data; and, finally, (6) feed back to the analyst understandable summaries of information from the data. We review current developments in tree-based ensemble methods to mine effectively the knowledge hidden in chemical and biology data. We report on applications of these algorithms to variable selection, outlier detection, supervised pattern analysis, cluster analysis, and tree-based kernel and ensemble learning. Through this report, we wish to inspire chemists to take greater interest in decision trees and to obtain greater benefits from using the tree-based ensemble techniques. (C) 2012 Elsevier Ltd. All rights reserved.
引用
收藏
页码:158 / 167
页数:10
相关论文
共 50 条
  • [31] Rationale and Applications of Survival Tree and Survival Ensemble Methods
    Yan Zhou
    John J. McArdle
    Psychometrika, 2015, 80 : 811 - 833
  • [32] Rationale and Applications of Survival Tree and Survival Ensemble Methods
    Zhou, Yan
    McArdle, John J.
    PSYCHOMETRIKA, 2015, 80 (03) : 811 - 833
  • [33] A tree-based intelligence ensemble approach for spatial prediction of potential groundwater
    Avand, Mohammadtaghi
    Janizadeh, Saeid
    Tien Bui, Dieu
    Pham, Viet Hoa
    Ngo, Phuong Thao T.
    Nhu, Viet-Ha
    INTERNATIONAL JOURNAL OF DIGITAL EARTH, 2020, 13 (12) : 1408 - 1429
  • [34] Interpretable tree-based ensemble model for predicting beach water quality
    Li, Lingbo
    Qiao, Jundong
    Yu, Guan
    Wang, Leizhi
    Li, Hong-Yi
    Liao, Chen
    Zhu, Zhenduo
    WATER RESEARCH, 2022, 211
  • [35] Utilization of Tree-Based Ensemble Models for Predicting the Shear Strength of Soil
    Rabbani, Ahsan
    Muslih, Jan Afzal
    Saxena, Mukul
    Patil, Santosh Kalyanrao
    Mulay, Bharat Nandkumar
    Tiwari, Mohit
    Usha, A.
    Kumari, Sunita
    Samui, Pijush
    TRANSPORTATION INFRASTRUCTURE GEOTECHNOLOGY, 2024, 11 (04) : 2382 - 2405
  • [36] Application of decision tree-based ensemble learning in the classification of breast cancer
    Ghiasi, Mohammad M.
    Zendehboudi, Sohrab
    COMPUTERS IN BIOLOGY AND MEDICINE, 2021, 128
  • [37] Tree-based Ensemble Classifier Learning for Automatic Brain Glioma Segmentation
    Amiri, Samya
    Mahjoub, Mohamed Ali
    Rekik, Islem
    NEUROCOMPUTING, 2018, 313 : 135 - 142
  • [38] Towards Explainability of Tree-Based Ensemble Models. A Critical Overview
    Sepiolo, Dominik
    Ligeza, Antoni
    NEW ADVANCES IN DEPENDABILITY OF NETWORKS AND SYSTEMS, DEPCOS-RELCOMEX 2022, 2022, 484 : 287 - 296
  • [39] Detection of Android Malware using Tree-based Ensemble Stacking Model
    Shafin, Sakib Shahriar
    Ahmed, Md Maroof
    Pranto, Mahmud Alam
    Chowdhury, Abdullahi
    2021 IEEE ASIA-PACIFIC CONFERENCE ON COMPUTER SCIENCE AND DATA ENGINEERING (CSDE), 2021,
  • [40] Comparison of regression tree-based methods in genomic selection
    Ashoori-Banaei, Sahar
    Ghafouri-Kesbi, Farhad
    Ahmadi, Ahmad
    JOURNAL OF GENETICS, 2021, 100 (02)