Confusion-Matrix-Based Kernel Logistic Regression for Imbalanced Data Classification

被引:112
|
作者
Ohsaki, Miho [1 ]
Wang, Peng [1 ]
Matsuda, Kenji [1 ]
Katagiri, Shigeru [1 ]
Watanabe, Hideyuki [2 ]
Ralescu, Anca [3 ]
机构
[1] Doshisha Univ, Grad Sch Sci & Engn, 1-3 Tataramiyakodani, Kyotanabe, Kyoto 6100321, Japan
[2] Natl Inst Informat & Commun Technol, 3-5 Hikaridai, Seika, Kyoto 6190289, Japan
[3] Univ Cincinnati, Coll Engn & Appl Sci, Dept Elect Engn & Comp Syst, 812 Rhodes Hall, Cincinnati, OH 45221 USA
关键词
Imbalanced data; confusion matrix; kernel logistic regression; minimum classification error and generalized probabilistic descent; OPTIMIZATION; SELECTION; MODEL;
D O I
10.1109/TKDE.2017.2682249
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
There have been many attempts to classify imbalanced data, since this classification is critical in a wide variety of applications related to the detection of anomalies, failures, and risks. Many conventional methods, which can be categorized into sampling, cost-sensitive, or ensemble, include heuristic and task dependent processes. In order to achieve a better classification performance by formulation without heuristics and task dependence, we propose confusion-matrix-based kernel logistic regression (CM-KLOGR). Its objective function is the harmonic mean of various evaluation criteria derived from a confusion matrix, such criteria as sensitivity, positive predictive value, and others for negatives. This objective function and its optimization are consistently formulated on the framework of KLOGR, based on minimum classification error and generalized probabilistic descent (MCE/GPD) learning. Due to the merits of the harmonic mean, KLOGR, and MCE/GPD, CM-KLOGR improves the multifaceted performances in a well-balanced way. This paper presents the formulation of CM-KLOGR and its effectiveness through experiments that comparatively evaluated CM-KLOGR using benchmark imbalanced datasets.
引用
收藏
页码:1806 / 1819
页数:14
相关论文
共 50 条
  • [31] A Study of Logistic Regression for Fatigue Classification Based on Data of Tongue and Pulse
    Shi, Yu Lin
    Jiang, Tao
    Hu, Xiao Juan
    Cui, Ji
    Cui, Long Tao
    Tu, Li Ping
    Yao, Xing Hua
    Huang, Jing Bin
    Xu, Jia Tuo
    EVIDENCE-BASED COMPLEMENTARY AND ALTERNATIVE MEDICINE, 2022, 2022
  • [32] Logistic Regression and Logistic Regression-Genetic Algorithm for Classification of Liver Cancer Data
    Wibowo, Velery Virgina Putri
    Rustam, Zuherman
    Laeli, Afifah Rofi
    Said, Alva Andhika
    2021 INTERNATIONAL CONFERENCE ON DECISION AID SCIENCES AND APPLICATION (DASA), 2021,
  • [33] SVM Classification for Imbalanced Data Using Conformal Kernel Transformation
    Zhang, Yong
    Fu, Panpan
    Liu, Wenzhe
    Zou, Li
    PROCEEDINGS OF THE 2014 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2014, : 2894 - 2900
  • [34] Face Recognition Based on Adaptive Kernel Logistic Regression
    Wang, Ziqiang
    Sun, Xia
    ADVANCES IN FUTURE COMPUTER AND CONTROL SYSTEMS, VOL 2, 2012, 160 : 257 - 262
  • [35] Quantum kernel logistic regression based Newton method
    Ning, Tong
    Yang, Youlong
    Du, Zhenye
    PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2023, 611
  • [36] A general robust low-rank multinomial logistic regression for corrupted matrix data classification
    Hu, Yuyu
    Fan, Yali
    Song, Yan
    Li, Ming
    APPLIED INTELLIGENCE, 2023, 53 (15) : 18564 - 18580
  • [37] Rank-k 2-D Multinomial Logistic Regression for Matrix Data Classification
    Song, Kun
    Nie, Feiping
    Han, Junwei
    Li, Xuelong
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 29 (08) : 3524 - 3537
  • [38] Imbalanced Data Classification Based on Clustering
    Li, Hu
    Zou, Peng
    Han, Weihong
    Xia, Rongze
    COMPUTER-AIDED DESIGN, MANUFACTURING, MODELING AND SIMULATION III, 2014, 443 : 741 - 745
  • [39] Classification and feature selection methods based on fitting logistic regression to PU data
    Furmanczyk, Konrad
    Paczutkowski, Kacper
    Dudzinski, Marcin
    Dziewa-Dawidczyk, Diana
    JOURNAL OF COMPUTATIONAL SCIENCE, 2023, 72
  • [40] Naval target classification based on the confusion matrix
    Giompapa, S.
    Farina, A.
    Gini, F.
    Graziano, A.
    Croci, R.
    Di Stefano, R.
    2008 IEEE AEROSPACE CONFERENCE, VOLS 1-9, 2008, : 1891 - +