Research and Application on Ensemble Learning Methods

被引:0
|
作者
Wang, Yuzhong [1 ]
机构
[1] Tianjin Univ Technol, Zhonghuan Informat Coll, Tianjin, Peoples R China
关键词
Dataset; Data Preprocessing; Ensemble learning; Data mining; Classification Models;
D O I
10.1007/978-981-32-9050-1_17
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
As shown in previous data, diabetes has led to the increasing mortality and considerable financial expenditure in the US. It is necessary to find out how to making correct diagnosis and prescription of diabetes plays an important role in helping patients. That is why we choose the dataset of diabetic inpatients having diagnosis at hospitals in the US, and predict how different treatments and medications influence patient outcomes. We use the class attribute of readmission number to obtain the results. Because of the large and biased dataset, we firstly remove attributes with high missing value rate, and reduce the imbalance classes of instances by over-sampling and under-sampling, then followed by the attribute selection through various methods, such as the Correlation-based feature selection, the Chi-Squared Attribute Evaluator, the Information Gain Attribute Evaluator, etc. Three classification methods C4.5, RIPPER, and Random Forests are used to predict the classification in Weka. In addition, we also use the ensemble learning methods including bagging and boosting to improve the stability and accuracy. From the analysing results, we can see that C4.5 and Ripper perform better, and both bagging and boosting increase the accuracy rate to differing degrees because both algorithms are somewhat unstable. There is no doubt that Random Forests is the best performer among all classification methods we use, and after using boosting, we see big increases in the values of the evaluation metrics we use. The final outcome is much better than random guess.
引用
收藏
页码:145 / 155
页数:11
相关论文
共 50 条
  • [1] Application Research of Ensemble Learning Frameworks
    Wang, Kunkun
    Liu, Xianda
    Zhao, Jianming
    Gao, Hongwei
    Zhang, Zhen
    [J]. 2020 CHINESE AUTOMATION CONGRESS (CAC 2020), 2020, : 5767 - 5772
  • [2] Application Research of Ensemble Learning Algorithm in Image Annotation
    Lin, Zhenxiang
    Guo, Jinlin
    Lao, Songyang
    [J]. PROCEEDINGS OF THE 2017 5TH INTERNATIONAL CONFERENCE ON FRONTIERS OF MANUFACTURING SCIENCE AND MEASURING TECHNOLOGY (FMSMT 2017), 2017, 130 : 131 - 134
  • [3] Research on Intrusion Detection Model Using Ensemble learning Methods
    Wang, Ying
    Shen, Yongjun
    Zhang, Guidong
    [J]. PROCEEDINGS OF 2016 IEEE 7TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE (ICSESS 2016), 2016, : 422 - 425
  • [4] Application of Biologically Inspired Methods to Improve Adaptive Ensemble Learning
    Grmanova, Gabriela
    Rozinajova, Viera
    Ezzedine, Anna Bou
    Lucka, Maria
    Lacko, Peter
    Loderer, Marek
    Vrablecova, Petra
    Laurinec, Peter
    [J]. ADVANCES IN NATURE AND BIOLOGICALLY INSPIRED COMPUTING, 2016, 419 : 235 - 246
  • [5] Research on Ensemble Learning
    Huang, Faliang
    Xie, Guoqing
    Xiao, Ruliang
    [J]. 2009 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND COMPUTATIONAL INTELLIGENCE, VOL III, PROCEEDINGS, 2009, : 249 - 252
  • [6] Ensemble methods in machine learning
    Dietterich, TG
    [J]. MULTIPLE CLASSIFIER SYSTEMS, 2000, 1857 : 1 - 15
  • [7] Research on multi-model ensemble machine learning methods for temperature forecasting
    Beijing Meteorological Service Centre, Beijing, China
    [J]. Proc. - Int. Conf. Comput., Inf. Process. Adv. Educ., CIPAE, (428-433):
  • [8] Application of Machine Learning Methods in Nursing Home Research
    Lee, Soo-Kyoung
    Ahn, Jinhyun
    Shin, Juh Hyun
    Lee, Ji Yeon
    [J]. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH, 2020, 17 (17) : 1 - 15
  • [9] Research of Federated Learning Application Methods and Social Responsibility
    Yang S.
    Zheng W.
    Xie M.
    Zhang X.
    [J]. IEEE Transactions on Big Data, 2024, 10 (06): : 1 - 12
  • [10] Application research on quantitative prediction of TCM syndrome differentiation based on ensemble learning
    Liang, Huaixin
    Yang, Xin
    Li, Shaoxiong
    Chen, Siheng
    Zhang, Xiaoqing
    [J]. INTERNATIONAL JOURNAL OF COMPUTER APPLICATIONS IN TECHNOLOGY, 2020, 64 (01) : 46 - 56