Joint imbalanced classification and feature selection for hospital readmissions

被引:68
|
作者
Du, Guodong [1 ]
Zhang, Jia [1 ]
Luo, Zhiming [2 ]
Ma, Fenglong [3 ]
Ma, Lei [4 ]
Li, Shaozi [1 ]
机构
[1] Xiamen Univ, Dept Artificial Intelligence, Xiamen 361005, Peoples R China
[2] Xiamen Univ, Postdoc Ctr Informat & Commun Engn, Xiamen 361005, Peoples R China
[3] Penn State Univ, Coll Informat Sci & Technol, University Pk, PA 16802 USA
[4] Kunming Univ Sci & Technol, Fac Informat Engn & Automat, Kunming 650500, Yunnan, Peoples R China
基金
中国博士后科学基金;
关键词
Hospital readmission; Imbalanced classification; Feature selection; l(1)-norm regularization; Convex optimization; RISK; OPTIMIZATION; PREDICTION; FRAMEWORK; MODELS; SMOTE;
D O I
10.1016/j.knosys.2020.106020
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Hospital readmission is one of the most important service quality measures. Recently, numerous risk assessment models have been proposed to address the hospital readmission problem. However, poor understanding of the class-imbalance hospital readmission data still challenges the development of accurate predictive models. To overcome the issue, a new risk prediction method termed joint imbalanced classification and feature selection (JICFS) is proposed for handling such a problem. To be specific, we construct the loss function within the large margin framework, in which the sample weight is involved to deal with the class imbalanced problem. Based on this, we design an optimization objective function involving l(1)-norm regularization for improving the performance, and an iterative scheme is proposed to solve the optimization problem, thereby achieving feature selection to improve the performance. Finally, experimental results on six real-world hospital readmission datasets demonstrate that the proposed algorithm has the advantage compared with some state-of-the-art methods. (C) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页数:12
相关论文
共 50 条
  • [41] Novel Approach to Predict Hospital Readmissions Using Feature Selection from Unstructured Data with Class Imbalance
    Sundararaman, Arun
    Ramanathan, Srinivasan Valady
    Thati, Ramprasad
    BIG DATA RESEARCH, 2018, 13 : 65 - 75
  • [42] Addressing Overlapping in Classification with Imbalanced Datasets: A First Multi-objective Approach for Feature and Instance Selection
    Fernandez, Alberto
    Jose del Jesus, Maria
    Herrera, Francisco
    INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING - IDEAL 2015, 2015, 9375 : 36 - 44
  • [43] A weighted pattern matching approach for classification of imbalanced data with a fireworks-based algorithm for feature selection
    Sreeja, N. K.
    CONNECTION SCIENCE, 2019, 31 (02) : 143 - 168
  • [44] Dealing with high-dimensional class-imbalanced datasets: Embedded feature selection for SVM classification
    Maldonado, Sebastian
    Lopez, Julio
    APPLIED SOFT COMPUTING, 2018, 67 : 94 - 105
  • [45] A Hybrid Feature Selection with Ensemble Classification for Imbalanced Healthcare Data: A Case Study for Brain Tumor Diagnosis
    Huda S.
    Yearwood J.
    Jelinek H.F.
    Hassan M.M.
    Fortino G.
    Buckland M.
    IEEE Access, 2016, 4 : 9145 - 9154
  • [46] Feature Selection with Imbalanced Data for Software Defect Prediction
    Khoshgoftaar, Taghi M.
    Gao, Kehan
    EIGHTH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, PROCEEDINGS, 2009, : 235 - +
  • [47] Feature Selection Algorithm for Class Imbalanced Internet Traffic
    Tang Hong
    Liu Dan
    Yao LiShuang
    Wang Yunfeng
    Pei Zuofei
    JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2021, 43 (04) : 923 - 930
  • [48] A Tabular Variational Auto Encoder-Based Hybrid Model for Imbalanced Data Classification With Feature Selection
    Abraham, Asha
    Mohideen, Habeeb Shaik
    Kayalvizhi, R.
    IEEE ACCESS, 2023, 11 : 122760 - 122771
  • [49] A Hybrid Feature Selection With Ensemble Classification for Imbalanced Healthcare Data A Case Study for Brain Tumor Diagnosis
    Huda, Shamsul
    Yearwood, John
    Jelinek, Herbert F.
    Hassan, Mohammad Mehedi
    Fortino, Giancorlo
    Buckland, Michael
    IEEE ACCESS, 2016, 4 : 9145 - 9154
  • [50] Feature selection for high-dimensional imbalanced data
    Yin, Liuzhi
    Ge, Yong
    Xiao, Keli
    Wang, Xuehua
    Quan, Xiaojun
    NEUROCOMPUTING, 2013, 105 : 3 - 11