Improved Dirichlet mixture model clustering algorithm for medical data anomaly detection

被引:0
|
作者
Wu, Lili [1 ,2 ]
Ali, Majid Khan Majahar [3 ]
Shan, Fam Pei [3 ]
Tian, Ying [4 ]
Tao, Li [3 ]
机构
[1] Xinzhou Teachers Univ, Dept Comp Sci, Xinzhou 034000, Peoples R China
[2] Univ Sains Malaysia USM, Sch Math Sci, George Town 11800, Malaysia
[3] USM, Sch Math Sci, George Town 11800, Malaysia
[4] Taiyuan Univ Technol, Dept Math, Taiyuan 030024, Peoples R China
关键词
over-diagnosis; anomaly expenses; anomaly detection; DPMM; CBLOF;
D O I
10.1504/IJBIC.2024.10064803
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In order to address the issue of identifying over-diagnosis and anomaly expenses in the healthcare service process, a local outlier mining clustering algorithm (ILOF-DPMM) is proposed by combining the clustering-based local outlier factor (CBLOF) algorithm with Dirichlet mixture model (DPMM). By extracting the patient's hospitalisation records from the medical record homepage, the influencing factors of hospitalisation costs for different disease types are classified, and the random forest method is used to reduce the feature dimension by disease type. The feature extraction and dimensionality reduction methods adopted by this algorithm effectively cluster medical insurance expense data. When calculating the LOF value of data, using a weighted calculation method based on the similarity of discrete and continuous features can more accurately detect abnormal data points in the data set, and has the ability to detect new data in real time, thus improving detection accuracy and efficiency.
引用
收藏
页码:11 / 21
页数:12
相关论文
共 50 条
  • [21] Research on unsupervised anomaly data detection method based on improved automatic encoder and Gaussian mixture model
    Liu, Xiangyu
    Zhu, Shibing
    Yang, Fan
    Liang, Shengjun
    Journal of Cloud Computing, 2022, 11 (01)
  • [22] Outlier detection in traffic data based on the Dirichlet process mixture model
    Ngan, Henry Y. T.
    Yung, Nelson H. C.
    Yeh, Anthony G. O.
    IET INTELLIGENT TRANSPORT SYSTEMS, 2015, 9 (07) : 773 - 781
  • [23] Research on unsupervised anomaly data detection method based on improved automatic encoder and Gaussian mixture model
    Liu, Xiangyu
    Zhu, Shibing
    Yang, Fan
    Liang, Shengjun
    JOURNAL OF CLOUD COMPUTING-ADVANCES SYSTEMS AND APPLICATIONS, 2022, 11 (01):
  • [24] Clustering with label constrained Dirichlet process mixture model
    Burhanuddin, Nurul Afiqah
    Adam, Mohd Bakri
    Ibrahim, Kamarulzaman
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2022, 107
  • [25] Unsupervised nested Dirichlet finite mixture model for clustering
    Fares Alkhawaja
    Nizar Bouguila
    Applied Intelligence, 2023, 53 : 25232 - 25258
  • [26] Graph Clustering Using Dirichlet Process Mixture Model
    Atastina, Imelda
    Sitohang, Benhard
    Putri, G. A. S.
    Moertini, Veronica S.
    PROCEEDINGS OF 2017 INTERNATIONAL CONFERENCE ON DATA AND SOFTWARE ENGINEERING (ICODSE), 2017,
  • [27] Unsupervised nested Dirichlet finite mixture model for clustering
    Alkhawaja, Fares
    Bouguila, Nizar
    APPLIED INTELLIGENCE, 2023, 53 (21) : 25232 - 25258
  • [28] Comparison between EM algorithm and dynamical clustering algorithm for Dirichlet mixture samples
    Xia B.
    Richard E.
    Wang H.
    Beijing Hangkong Hangtian Daxue Xuebao/Journal of Beijing University of Aeronautics and Astronautics, 2019, 45 (09): : 1805 - 1811
  • [29] Active Online Anomaly Detection using Dirichlet Process Mixture Model and Gaussian Process Classification
    Varadarajan, Jagannadan
    Subramanian, Ramanathan
    Ahuja, Narendra
    Moulin, Pierre
    Odobez, Jean-Marc
    2017 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2017), 2017, : 615 - 623
  • [30] Anomaly Detection using Improved Hierarchy Clustering
    Hu Liang
    Ren Wei-wu
    Ren Fei
    2009 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND COMPUTATIONAL INTELLIGENCE, VOL I, PROCEEDINGS, 2009, : 319 - 323