Anomaly detection of diabetes data based on hierarchical clustering and CNN

被引:6
|
作者
Fang, Jinhai [1 ]
Xie, Zuoling [2 ]
Cheng, Haitao [1 ]
Fan, Bin [1 ]
Xu, He [1 ]
Li, Peng [1 ,3 ]
机构
[1] Nanjing Univ Posts & Telecommun, Sch Comp Sci, Nanjing 210023, Peoples R China
[2] Zhongda Hosp Southeast Univ, Nanjing 210009, Peoples R China
[3] Inst Network Secur & Trusted Comp, Nanjing 210023, Peoples R China
基金
国家重点研发计划;
关键词
Convolutional neural network; Diabetes data; Hierarchical clustering; Anomaly detection; SVM; OUTLIERS;
D O I
10.1016/j.procs.2022.01.010
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we propose a diabetes data anomaly detection approach based on hierarchical clustering and support vector machine (SVM), named hierarchical support vector machine (HCSVM). In the HCSVM approach, the diabetes data sets with the same data characteristics are classified by clustering algorithm, and the data are divided into significant abnormal parts and potential abnormal parts. Additionally, the convolutional neural network (CNN) is utilized to detect and analyze the data of each part. The feature vector output from CNN full connection layer is applied as the input data of SVM classifier, and the optimal classification hyperplane is constructed in high-dimensional space for classification, so as to detect the abnormal data in diabetes data more pertinently. Finally, a real diabetes data set collected by a hospital is used for experiment, and ROC curve is adopted to evaluate the performance of the proposed approach compared with random forest algorithm, KNN algorithm and SVM algorithm. The results show that the HCSVM algorithm combined with hierarchical clustering and SVM can achieve a better performance. (C) 2021 The Authors. Published by Elsevier B.V.
引用
收藏
页码:71 / 78
页数:8
相关论文
共 50 条
  • [1] Network Anomaly Detection Based on Dynamic Hierarchical Clustering of Cross Domain Data
    Liu, Yang
    Xu, Hongping
    Yi, Hang
    Lin, Zhen
    Kang, Jian
    Xia, Weiqiang
    Shi, Qingping
    Liao, Youping
    Ying, Yulong
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON SOFTWARE QUALITY, RELIABILITY AND SECURITY COMPANION (QRS-C), 2017, : 200 - 204
  • [2] An efficient parallel anomaly detection algorithm based on hierarchical clustering
    Wei-Wu, Ren
    Liang, Hu
    Kuo, Zhao
    Jianfeng, Chu
    [J]. Journal of Networks, 2013, 8 (03) : 672 - 679
  • [3] An improved agglomerative hierarchical clustering anomaly detection method for scientific data
    Shi, Peng
    Zhao, Zhen
    Zhong, Huaqiang
    Shen, Hangyu
    Ding, Lianhong
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2021, 33 (06):
  • [4] Anomaly detection model based on data stream clustering
    Yin, Chunyong
    Zhang, Sun
    Yin, Zhichao
    Wang, Jin
    [J]. CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2019, 22 (Suppl 1): : 1729 - 1738
  • [5] Anomaly detection model based on data stream clustering
    Chunyong Yin
    Sun Zhang
    Zhichao Yin
    Jin Wang
    [J]. Cluster Computing, 2019, 22 : 1729 - 1738
  • [6] Anomaly intrusion detection based on clustering a data stream
    Oh, Sang-Hyun
    Kang, Jin-Suk
    Bytin, Yung-Cheol
    Jeong, Taikyeong T.
    Lee, Won-Suk
    [J]. INFORMATION SECURITY, PROCEEDINGS, 2006, 4176 : 415 - 426
  • [7] Anomaly Based Intrusion Detection System Using Hierarchical Classification and Clustering Techniques
    Bahjat, Hala
    Mohammed, Suhaila N.
    Ahmed, Wafaa
    Hamad, Sumaya
    Mohammed, Shayma
    [J]. 2020 13TH INTERNATIONAL CONFERENCE ON DEVELOPMENTS IN ESYSTEMS ENGINEERING (DESE 2020), 2020, : 257 - 262
  • [8] Photovoltaic anomaly data detection method based on clustering iForest
    Han, Bitong
    Shan, Yu
    Xie, Hongbin
    Ge, Leyi
    [J]. THIRD INTERNATIONAL CONFERENCE ON ELECTRONICS AND COMMUNICATION; NETWORK AND COMPUTER TECHNOLOGY (ECNCT 2021), 2022, 12167
  • [9] Volume Traffic Anomaly Detection Using Hierarchical Clustering
    Son, Choonho
    Cho, Seok-Hyung
    Yoo, Jae-Hyoung
    [J]. MANAGEMENT ENABLING THE FUTURE INTERNET FOR CHANGING BUSINESS AND NEW COMPUTING SERVICES, PROCEEDINGS, 2009, 5787 : 291 - 300
  • [10] Anomaly Detection Using Agglomerative Hierarchical Clustering Algorithm
    Mazarbhuiya, Fokrul Alom
    AlZahrani, Mohammed Y.
    Georgieva, Lilia
    [J]. INFORMATION SCIENCE AND APPLICATIONS 2018, ICISA 2018, 2019, 514 : 475 - 484