Diversity-Based Random Forests with Sample Weight Learning

被引:5
|
作者
Yang, Chun [1 ,2 ]
Yin, Xu-Cheng [1 ,3 ]
机构
[1] Univ Sci & Technol Beijing, Sch Comp & Commun Engn, Dept Comp Sci & Technol, Beijing 100083, Peoples R China
[2] Univ Sci & Technol Beijing, Sch Comp & Commun Engn, Beijing Key Lab Mat Sci Knowledge Engn, Beijing 100083, Peoples R China
[3] Univ Sci & Technol Beijing, Inst Artificial Intelligence, Beijing 100083, Peoples R China
基金
中国国家自然科学基金; 北京市自然科学基金; 中国博士后科学基金;
关键词
Diversity-based ensembles; Classifier ensemble; Random forests; Sample weight learning; Convex quadratic programming; CLASSIFIER ENSEMBLES; NEURAL-NETWORKS; SPARSITY;
D O I
10.1007/s12559-019-09652-0
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Given a variety of classifiers, one prevalent approach in classifier ensemble is to diversely combine classifier components, i.e., diversity-based ensembles, and a lot of previous works show that these ensembles can improve classification accuracy. Random forests are one of the most important ensembles. However, most random forests approaches with diversity-related aspects focus on maximizing tree diversity while producing and training component trees. Alternatively, a novel cognitive-inspired diversity-based random forests method, diversity-based random forests via sample weight learning (DRFS), is proposed. Given numerous component trees from the original random forests, DRFS selects and combines tree classifiers adaptively via diversity learning and sample weight learning. By designing a matrix for the data distribution creatively, a unified optimization model is formulated to learn and select diverse trees, where tree weights are learned through a convex quadratic programming problem with sample weights. Moreover, a self-training algorithm is proposed to solve the convex optimization iteratively and learn sample weights automatically. Comparative experiments on 39 typical UCI classification benchmarks and a variety of real-world text categorization benchmarks of our proposed method are conducted. Extensive experiments show that our method outperforms the traditional methods. Our proposed DRFS method can select and combine tree classifiers adaptively and improves the performance on a variety of classification tasks.
引用
收藏
页码:685 / 696
页数:12
相关论文
共 50 条
  • [1] Diversity-Based Random Forests with Sample Weight Learning
    Chun Yang
    Xu-Cheng Yin
    [J]. Cognitive Computation, 2019, 11 : 685 - 696
  • [2] Diversity-Based Ensemble with Sample Weight Learning
    Yang, Chun
    Yin, Xu-Cheng
    Hao, Hong-Wei
    [J]. 2014 22ND INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2014, : 1236 - 1241
  • [3] Diversity-based interactive learning meets multimodality
    Calumby, Rodrigo Tripodi
    Goncalves, Marcos Andre
    Torres, Ricardo da Silva
    [J]. NEUROCOMPUTING, 2017, 259 : 159 - 175
  • [4] Diversity-Based Media Search
    Aarabi, Parham
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO WORKSHOPS (IEEE ICMEW 2022), 2022,
  • [5] Diversity-Based Boosting Algorithm
    Alzubi, Jafar A.
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2016, 7 (05) : 524 - 529
  • [6] Diversity-based reasoning in children
    Heit, E
    Hahn, U
    [J]. COGNITIVE PSYCHOLOGY, 2001, 43 (04) : 243 - 273
  • [7] Active Learning for Ordinal Classification Based on Adaptive Diversity-Based Uncertainty Sampling
    He, Deniu
    [J]. IEEE ACCESS, 2023, 11 : 16396 - 16410
  • [8] Combining diversity-based active learning with discriminant analysis in image retrieval
    Dagli, CK
    Rajaram, S
    Huang, TS
    [J]. Third International Conference on Information Technology and Applications, Vol 1, Proceedings, 2005, : 173 - 178
  • [9] On the diversity-based measures of equalness and evenness
    Gregorius, Hans-Rolf
    Gillet, Elizabeth M.
    [J]. METHODS IN ECOLOGY AND EVOLUTION, 2024, 15 (03): : 583 - 589
  • [10] Diversity-based case base maintenance
    Yang, LZ
    Ha, MH
    Wang, XZ
    [J]. 2003 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-5, PROCEEDINGS, 2003, : 1591 - 1596