When efficient model averaging out-performs boosting and bagging

被引:0
|
作者
Davidson, Ian [1 ]
Fan, Wei
机构
[1] SUNY Albany, Albany, NY 12222 USA
[2] IBM TJ Watson, Hawthorne, NY 10532 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The Bayes optimal classifier (BOC) is an ensemble technique used extensively in the statistics literature. However, compared to other ensemble techniques such as bagging and boosting, BOC is less known and rarely used in data mining. This is partly due to BOC being perceived as being inefficient and because bagging and boosting consistently outperforms a single model, which raises the question: "Do we even need BOC in datamining?". We show that the answer to this question is "yes" by illustrating several recent efficient model averaging approximations to BOC can significantly outperform bagging and boosting in realistic situations such as extensive class label noise, sample selection bias and many-class problems. That model averaging techniques outperform bagging and boosting in these situations has not been published in the machine learning, mining or statistical communities to our knowledge.
引用
收藏
页码:478 / 486
页数:9
相关论文
共 50 条
  • [21] Efficient Model Averaging for Deep Neural Networks
    Opitz, Michael
    Possegger, Horst
    Bischof, Horst
    [J]. COMPUTER VISION - ACCV 2016, PT II, 2017, 10112 : 205 - 220
  • [22] An Efficient Ensemble Algorithm for Boosting k-Nearest Neighbors Classification Performance via Feature Bagging
    Nguyen, Huu-Hoa
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (06) : 767 - 776
  • [23] GFAP Out-Performs S100β in Detecting Traumatic Intracranial Lesions on Computed Tomography in Trauma Patients with Mild Traumatic Brain Injury and Those with Extracranial Lesions
    Papa, Linda
    Silvestri, Salvatore
    Brophy, Gretchen M.
    Giordano, Philip
    Falk, Jay L.
    Braga, Carolina F.
    Tan, Ciara N.
    Ameli, Neema J.
    Demery, Jason A.
    Dixit, Neha K.
    Mendes, Matthew E.
    Hayes, Ronald L.
    Wang, Kevin K. W.
    Robertson, Claudia S.
    [J]. JOURNAL OF NEUROTRAUMA, 2014, 31 (22) : 1815 - U11
  • [24] Quantification of Primary Tumor-Associated CD3+Lymphocytes Out-Performs Mismatch Repair Deficiency in Predicting Recurrence in Endometrioid-Type Endometrial Carcinoma
    Avila, Monica
    Fellman, Bryan
    Crumley, Suzanne
    Hudgens, Courtney
    Tetzlaff, Michael
    Broaddus, Russell
    [J]. LABORATORY INVESTIGATION, 2020, 100 (SUPPL 1) : 1014 - 1015
  • [25] Prediction model for rice husk ash concrete using AI approach: Boosting and bagging algorithms
    Amin, Muhammad Nasir
    Iftikhar, Bawar
    Khan, Kaffayatullah
    Javed, Muhammad Faisal
    AbuArab, Abdullah Mohammad
    Rehman, Muhammad Faisal
    [J]. STRUCTURES, 2023, 50 : 745 - 757
  • [26] Protocol Based on Thromboelastograph (TEG) Out-Performs Physician Preference Using Laboratory Coagulation Tests to Guide Blood Replacement During and After Cardiac Surgery: A Pilot Study
    Westbrook, Andrew J.
    Olsen, Jodi
    Bailey, Michael
    Bates, John
    Scully, Michael
    Salamonsen, Robert F.
    [J]. HEART LUNG AND CIRCULATION, 2009, 18 (04): : 277 - 288
  • [27] Quantification of Primary Tumor-Associated CD3+Lymphocytes Out-Performs Mismatch Repair Deficiency in Predicting Recurrence in Endometrioid-Type Endometrial Carcinoma
    Avila, Monica
    Fellman, Bryan
    Crumley, Suzanne
    Hudgens, Courtney
    Tetzlaff, Michael
    Broaddus, Russell
    [J]. MODERN PATHOLOGY, 2020, 33 (SUPPL 2) : 1014 - 1015
  • [28] Predicting Returns Out of Sample: A Naive Model Averaging Approach
    Chen, Huafeng
    Jiang, Liang
    Liu, Weiwei
    [J]. REVIEW OF ASSET PRICING STUDIES, 2023, 13 (03): : 579 - 614
  • [29] In Children and Youth with Mild and Moderate Traumatic Brain Injury, Glial Fibrillary Acidic Protein Out-Performs S100b in Detecting Traumatic Intracranial Lesions on Computed Tomography
    Papa, Linda
    Mittal, Manoj K.
    Ramirez, Jose
    Ramia, Michelle
    Kirby, Sara
    Silvestri, Salvatore
    Giordano, Philip
    Weber, Kurt
    Braga, Carolina F.
    Tan, Ciara N.
    Ameli, Neema J.
    Lopez, Marco
    Zonfrillo, Mark
    [J]. JOURNAL OF NEUROTRAUMA, 2016, 33 (01) : 58 - 64
  • [30] THE USE OF BOOTSTRAP MODEL AVERAGING WHEN ESTIMATING SURVIVAL CURVES
    Parker, C.
    Hawkins, N.
    [J]. VALUE IN HEALTH, 2015, 18 (03) : A23 - A23