aggregation;
class probability estimation;
random forest;
REGRESSION;
ERROR;
D O I:
10.1002/sam.11446
中图分类号:
TP18 [人工智能理论];
学科分类号:
081104 ;
0812 ;
0835 ;
1405 ;
摘要:
In random forest methodology, an overall prediction or estimate is made by aggregating predictions made by individual decision trees. Popular implementations of random forests rely on different methods for aggregating predictions. In this study, we provide an empirical analysis of the performance of aggregation approaches available for classification and regression problems. We show that while the choice of aggregation scheme usually has little impact in regression, it can have a profound effect on probability estimation in classification problems. Our study illustrates the causes of calibration issues that arise from two popular aggregation approaches and highlights the important role that terminal nodesize plays in the aggregation of tree predictions. We show that optimal choices for random forest tuning parameters depend heavily on the manner in which tree predictions are aggregated.
机构:
China Univ Geosci, Dept Comp Sci, Wuhan 430074, Hubei, Peoples R ChinaChina Univ Geosci, Dept Comp Sci, Wuhan 430074, Hubei, Peoples R China
Jiang, Liangxiao
Cai, Zhihua
论文数: 0引用数: 0
h-index: 0
机构:
China Univ Geosci, Dept Comp Sci, Wuhan 430074, Hubei, Peoples R ChinaChina Univ Geosci, Dept Comp Sci, Wuhan 430074, Hubei, Peoples R China
Cai, Zhihua
Wang, Dianhong
论文数: 0引用数: 0
h-index: 0
机构:
China Univ Geosci, Dept Elect Engn, Wuhan 430074, Hunan, Peoples R ChinaChina Univ Geosci, Dept Comp Sci, Wuhan 430074, Hubei, Peoples R China
Wang, Dianhong
Zhang, Harry
论文数: 0引用数: 0
h-index: 0
机构:
Univ New Brunswick Fredericton, Fac Comp Sci, Fredericton, NB E3B 5A3, CanadaChina Univ Geosci, Dept Comp Sci, Wuhan 430074, Hubei, Peoples R China
机构:
KU, Dept Dev & Regenerat, Leuven, Belgium
KU, Leuven Unit Hlth Technol Assessment Res LUHTAR, Leuven, BelgiumKU, Dept Dev & Regenerat, Leuven, Belgium
Barrenada, Lasai
Dhiman, Paula
论文数: 0引用数: 0
h-index: 0
机构:
Univ Oxford, Nuffield Dept Orthopaed Rheumatol & Musculoskeleta, Ctr Stat Med, Oxford, EnglandKU, Dept Dev & Regenerat, Leuven, Belgium
Dhiman, Paula
Timmerman, Dirk
论文数: 0引用数: 0
h-index: 0
机构:
KU, Dept Dev & Regenerat, Leuven, Belgium
Univ Hosp Leuven, Dept Obstet & Gynecol, Leuven, BelgiumKU, Dept Dev & Regenerat, Leuven, Belgium
机构:
KU, Dept Dev & Regenerat, Leuven, Belgium
KU, Leuven Unit Hlth Technol Assessment Res LUHTAR, Leuven, Belgium
Leiden Univ, Med Ctr, Dept Biomed Data Sci, Leiden, NetherlandsKU, Dept Dev & Regenerat, Leuven, Belgium
机构:
Chinese Acad Sci, Acad Math & Syst Sci, Beijing 100190, Peoples R ChinaChinese Acad Sci, Acad Math & Syst Sci, Beijing 100190, Peoples R China
Sheng, Ying
Wang, Qihua
论文数: 0引用数: 0
h-index: 0
机构:
Chinese Acad Sci, Acad Math & Syst Sci, Beijing 100190, Peoples R China
Univ Chinese Acad Sci, Beijing 100049, Peoples R ChinaChinese Acad Sci, Acad Math & Syst Sci, Beijing 100190, Peoples R China
机构:
Nanjing Normal Univ, Sch Comp & Elect Informat, Nanjing, Peoples R China
Nanjing Univ, State Key Lab Novel Software Technol, Nanjing, Peoples R ChinaNanjing Normal Univ, Sch Comp & Elect Informat, Nanjing, Peoples R China
Wu, Xiaotong
论文数: 引用数:
h-index:
机构:
Bilal, Muhammad
Xu, Xiaolong
论文数: 0引用数: 0
h-index: 0
机构:
Nanjing Univ Informat Sci & Technol, Sch Software, Nanjing, Peoples R ChinaNanjing Normal Univ, Sch Comp & Elect Informat, Nanjing, Peoples R China
Xu, Xiaolong
Song, Houbing
论文数: 0引用数: 0
h-index: 0
机构:
Univ Maryland Baltimore Cty UMBC, Dept Informat Syst, Baltimore, MD 21250 USANanjing Normal Univ, Sch Comp & Elect Informat, Nanjing, Peoples R China
机构:
Department of Development and Regeneration,Centre for Statistics in Medicine, Nuffield Department of Orthopaedics, Rheumatology and Musculoskeletal SciencesDepartment of Development and Regeneration,Centre for Statistics in Medicine, Nuffield Department of Orthopaedics, Rheumatology and Musculoskeletal Sciences
Lasai Barreñada
Paula Dhiman
论文数: 0引用数: 0
h-index: 0
机构:
Leuven Unit for Health Technology Assessment Research (LUHTAR),Department of Obstetrics and GynecologyDepartment of Development and Regeneration,Centre for Statistics in Medicine, Nuffield Department of Orthopaedics, Rheumatology and Musculoskeletal Sciences
Paula Dhiman
Dirk Timmerman
论文数: 0引用数: 0
h-index: 0
机构:
University of Oxford,Institute for Medical Information Processing, Biometry and Epidemiology, Faculty of MedicineDepartment of Development and Regeneration,Centre for Statistics in Medicine, Nuffield Department of Orthopaedics, Rheumatology and Musculoskeletal Sciences
Dirk Timmerman
Anne-Laure Boulesteix
论文数: 0引用数: 0
h-index: 0
机构:
Department of Development and Regeneration,Centre for Statistics in Medicine, Nuffield Department of Orthopaedics, Rheumatology and Musculoskeletal SciencesDepartment of Development and Regeneration,Centre for Statistics in Medicine, Nuffield Department of Orthopaedics, Rheumatology and Musculoskeletal Sciences
Anne-Laure Boulesteix
Ben Van Calster
论文数: 0引用数: 0
h-index: 0
机构:
University Hospitals Leuven,Department of Biomedical Data SciencesDepartment of Development and Regeneration,Centre for Statistics in Medicine, Nuffield Department of Orthopaedics, Rheumatology and Musculoskeletal Sciences