Robust Regression in Environmental Modeling Based on Bayesian Additive Regression Trees

被引:0
|
作者
Cao, Taoyun [1 ,2 ]
Lu, Limin [1 ]
Jiang, Tangxing [1 ]
机构
[1] Guangdong Univ Finance & Econ, Sch Math & Stat, Guangzhou 510320, Peoples R China
[2] Guangdong Univ Finance & Econ, Big Data & Educ Stat Applicat Lab, Guangzhou 510320, Peoples R China
关键词
Robust regression; Machine learning; Bayesian nonparametric; Bayesian additive regression trees; Forecasting;
D O I
10.1007/s10666-023-09925-x
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
One challenging task in analyzing environmental data is how to achieve better prediction in presence of outliers, which is inevitable, especially in environmental modeling. The purpose of prediction is to draw valid conclusions and provide useful advices in environmental management. Robust approach is often desirable when there exist outliers. Robust nonparametric regression methods have attracted much attention from a practical point of view. As a nonparametric Bayesian approach, Bayesian additive regression trees (BART) provides a flexible regression that captures potential nonlinear relationships and complex interactions via the framework including sum-of-trees model, regularization prior and Bayesian backfitting MCMC algorithm (hereafter backfitting MCMC). When outliers are present in the data, we find out that BART has the highest prediction performance in simulation studies which are carried out in a variety of cases, compared to the well-known machine learning methods: random forest, support vector machine, extreme gradient boosting. The findings of this study demonstrate that BART approach is seen to be remarkably robust to outliers. For illustration, we also analyze two datasets which exist many underlying outliers: one from a study of forest fires related factors and the other from a study of biomass fuels.
引用
收藏
页码:31 / 43
页数:13
相关论文
共 50 条
  • [1] Robust Regression in Environmental Modeling Based on Bayesian Additive Regression Trees
    Taoyun Cao
    Limin Lu
    Tangxing Jiang
    [J]. Environmental Modeling & Assessment, 2024, 29 (1) : 31 - 43
  • [2] Bayesian additive regression trees with model trees
    Prado, Estevao B.
    Moral, Rafael A.
    Parnell, Andrew C.
    [J]. STATISTICS AND COMPUTING, 2021, 31 (03)
  • [3] Bayesian additive regression trees with model trees
    Estevão B. Prado
    Rafael A. Moral
    Andrew C. Parnell
    [J]. Statistics and Computing, 2021, 31
  • [4] Parallel Bayesian Additive Regression Trees
    Pratola, Matthew T.
    Chipman, Hugh A.
    Gattiker, James R.
    Higdon, David M.
    McCulloch, Robert
    Rust, William N.
    [J]. JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2014, 23 (03) : 830 - 852
  • [5] BART: BAYESIAN ADDITIVE REGRESSION TREES
    Chipman, Hugh A.
    George, Edward I.
    McCulloch, Robert E.
    [J]. ANNALS OF APPLIED STATISTICS, 2010, 4 (01): : 266 - 298
  • [6] XBART: Accelerated Bayesian Additive Regression Trees
    He, Jingyu
    Yalov, Saar
    Hahn, P. Richard
    [J]. 22ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 89, 2019, 89
  • [7] Particle Gibbs for Bayesian Additive Regression Trees
    Lakshminarayanan, Balaji
    Roy, Daniel M.
    Teh, Yee Whye
    [J]. ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 38, 2015, 38 : 553 - 561
  • [8] Partially fixed bayesian additive regression trees
    Ran, Hao
    Bai, Yang
    [J]. STATISTICAL THEORY AND RELATED FIELDS, 2024, 8 (03) : 232 - 242
  • [9] Multinomial probit Bayesian additive regression trees
    Kindo, Bereket P.
    Wang, Hao
    Pena, Edsel A.
    [J]. STAT, 2016, 5 (01): : 119 - 131
  • [10] Bayesian Additive Regression Trees using Bayesian model averaging
    Belinda Hernández
    Adrian E. Raftery
    Stephen R Pennington
    Andrew C. Parnell
    [J]. Statistics and Computing, 2018, 28 : 869 - 890