Anomaly detection of diabetes data based on hierarchical clustering and CNN

被引:6
|
作者
Fang, Jinhai [1 ]
Xie, Zuoling [2 ]
Cheng, Haitao [1 ]
Fan, Bin [1 ]
Xu, He [1 ]
Li, Peng [1 ,3 ]
机构
[1] Nanjing Univ Posts & Telecommun, Sch Comp Sci, Nanjing 210023, Peoples R China
[2] Zhongda Hosp Southeast Univ, Nanjing 210009, Peoples R China
[3] Inst Network Secur & Trusted Comp, Nanjing 210023, Peoples R China
基金
国家重点研发计划;
关键词
Convolutional neural network; Diabetes data; Hierarchical clustering; Anomaly detection; SVM; OUTLIERS;
D O I
10.1016/j.procs.2022.01.010
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we propose a diabetes data anomaly detection approach based on hierarchical clustering and support vector machine (SVM), named hierarchical support vector machine (HCSVM). In the HCSVM approach, the diabetes data sets with the same data characteristics are classified by clustering algorithm, and the data are divided into significant abnormal parts and potential abnormal parts. Additionally, the convolutional neural network (CNN) is utilized to detect and analyze the data of each part. The feature vector output from CNN full connection layer is applied as the input data of SVM classifier, and the optimal classification hyperplane is constructed in high-dimensional space for classification, so as to detect the abnormal data in diabetes data more pertinently. Finally, a real diabetes data set collected by a hospital is used for experiment, and ROC curve is adopted to evaluate the performance of the proposed approach compared with random forest algorithm, KNN algorithm and SVM algorithm. The results show that the HCSVM algorithm combined with hierarchical clustering and SVM can achieve a better performance. (C) 2021 The Authors. Published by Elsevier B.V.
引用
收藏
页码:71 / 78
页数:8
相关论文
共 50 条
  • [21] Tabular Data Anomaly Detection Based on Density Peak Clustering Algorithm
    Liang, Dong
    Wang, Jun
    Zhang, Wenping
    Liu, Yuqi
    Wang, Lei
    Zhao, Xiaoyong
    [J]. 2022 INTERNATIONAL CONFERENCE ON BIG DATA, INFORMATION AND COMPUTER NETWORK (BDICN 2022), 2022, : 16 - 21
  • [22] A CNN Based Approach for Crowd Anomaly Detection
    Joshi, Kinjal, V
    Patel, Narendra M.
    [J]. INTERNATIONAL JOURNAL OF NEXT-GENERATION COMPUTING, 2021, 12 (01): : 1 - 11
  • [23] Data driven battery anomaly detection based on shape based clustering for the data centers class
    Haider, Syed Naeem
    Zhao, Qianchuan
    Li, Xueliang
    [J]. JOURNAL OF ENERGY STORAGE, 2020, 29
  • [24] Clustering and classification based anomaly detection
    Yang, Hongyu
    Xie, Feng
    Lu, Yi
    [J]. FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, PROCEEDINGS, 2006, 4223 : 1082 - 1091
  • [25] A NEGATIVE SELECTION ALGORITHM BASED ON HIERARCHICAL CLUSTERING OF SELF SET AND ITS APPLICATION IN ANOMALY DETECTION
    Chen, Wen
    Liu, Xiao-Jie
    Li, Tao
    Shi, Yuan-Quan
    Zheng, Xu-Fei
    Zhao, Hui
    [J]. INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2011, 4 (04) : 410 - 419
  • [26] A Negative Selection Algorithm Based on Hierarchical Clustering of Self set and its Application in Anomaly Detection
    Chen W.
    Liu X.-J.
    Li T.
    Shi Y.-Q.
    Zheng X.-F.
    Zhao H.
    [J]. International Journal of Computational Intelligence Systems, 2011, 4 (4) : 410 - 419
  • [27] Deep clustering hierarchical latent representation for anomaly-based cyber-attack detection
    Nguyen, Van Quan
    Ngo, Long Thanh
    Nguyen, Le Minh
    Nguyen, Viet Hung
    Shone, Nathan
    [J]. KNOWLEDGE-BASED SYSTEMS, 2024, 301
  • [28] HUAD: Hierarchical Urban Anomaly Detection Based on Spatio-Temporal Data
    Kong, Xiangjie
    Gao, Haoran
    Alfarraj, Osama
    Ni, Qichao
    Zheng, Chaofan
    Shen, Guojiang
    [J]. IEEE ACCESS, 2020, 8 : 26573 - 26582
  • [29] Research on Anomaly Detection of Smart Meters Data Based on Feature Clustering Method
    Chen, Zhiru
    Guo, Liang
    Du, Yan
    Dong, Xianguang
    Wang, Yuxi
    Liu, Ningning
    [J]. 2021 IEEE IAS INDUSTRIAL AND COMMERCIAL POWER SYSTEM ASIA (IEEE I&CPS ASIA 2021), 2021, : 1562 - 1566
  • [30] Anomaly detection for streaming data based on grid-clustering and Gaussian distribution
    Zou, Beiji
    Yang, Kangkang
    Kui, Xiaoyan
    Liu, Jun
    Liao, Shenghui
    Zhao, Wei
    [J]. INFORMATION SCIENCES, 2023, 638