Multi-stage Mixed Attribute Outlier Detection Algorithm Based on Neighborhood Density Difference

被引:0
|
作者
Du, Haizhou [1 ]
Fang, Wei [1 ]
Liu, Qing [1 ]
Yang, Zhenchen [2 ]
Wang, Xiaofeng [3 ]
机构
[1] Shanghai Univ Elect Power, Sch Comp Sci & Technol, Shanghai, Peoples R China
[2] Shanghai Elect Power Xinda New Energy Technol Co, Shanghai, Peoples R China
[3] State Grid Hangzhou Xiaoshan Power Supply Co, Hangzhou, Peoples R China
关键词
Outlier detection; Mixed attribute; Neighborhood density difference;
D O I
10.1109/BIGCOM.2019.00031
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the development of big data, an increasing number of mixed attribute dataset has become ubiquitous. It is also extremely important for decision analysis in the processing of a mixed attribute dataset. The existing outlier detection algorithm does not process a mixed attribute dataset and some special objects, because of too high computational time complexity and unsatisfactory detection results. In this paper, we present a multi-stage mixed attribute outlier detection algorithm. Firstly, with data set being divided, the neighborhood information was constructed based on the heterogeneous similarity metric to generate the core point. Then, the primitive clusters can be formed on the basis of the definition. Finally, a neighborhood density difference metric-based outlier detection algorithm was designed to construct neighborhood outlier factor (NOF). Extensive experimental results show the advantages of the proposed method, which could improve the outlier detection accuracy and reduce the time complexity on mixed attributed.
引用
收藏
页码:160 / 168
页数:9
相关论文
共 50 条
  • [11] ODRA: an outlier detection algorithm based on relevant attribute analysis method
    Wahid, Abdul
    Rao, Annavarapu Chandra Sekhara
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2021, 24 (01): : 569 - 585
  • [12] RDOF: An outlier detection algorithm based on relative density
    Wahid, Abdul
    Rao, Annavarapu Chandra Sekhara
    EXPERT SYSTEMS, 2022, 39 (02)
  • [13] Density-based trajectory outlier detection algorithm
    Zhipeng Liu
    Dechang Pi
    Jinfeng Jiang
    JournalofSystemsEngineeringandElectronics, 2013, 24 (02) : 335 - 340
  • [14] An outlier detection algorithm based on local density feedback
    Zhang, Zhongping
    Hou, Yuehan
    Jia, Yin
    Zhang, Ruibo
    KNOWLEDGE AND INFORMATION SYSTEMS, 2025, : 3599 - 3629
  • [15] Relative Density-Based Outlier Detection Algorithm
    Ning, Jin
    Chen, Leiting
    Chen, Junwei
    PROCEEDINGS OF 2018 THE 2ND INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ARTIFICIAL INTELLIGENCE (CSAI 2018) / 2018 THE 10TH INTERNATIONAL CONFERENCE ON INFORMATION AND MULTIMEDIA TECHNOLOGY (ICIMT 2018), 2018, : 227 - 231
  • [16] Density-based trajectory outlier detection algorithm
    Liu, Zhipeng
    Pi, Dechang
    Jiang, Jinfeng
    JOURNAL OF SYSTEMS ENGINEERING AND ELECTRONICS, 2013, 24 (02) : 335 - 340
  • [17] An Outlier Detection Algorithm Based on Probability Density Clustering
    Wang, Wei
    Ren, Yongjian
    Zhou, Renjie
    Zhang, Jilin
    INTERNATIONAL JOURNAL OF DATA WAREHOUSING AND MINING, 2023, 19 (01) : 22 - 22
  • [18] Outlier detection algorithm based on fast density peak clustering outlier factor
    Zhang, Zhongping
    Li, Sen
    Liu, Weixiong
    Liu, Shuxia
    Tongxin Xuebao/Journal on Communications, 2022, 43 (10): : 186 - 195
  • [19] Attribute reduction with fuzzy rough set based on multiobjective neighborhood difference algorithm
    Li B.-Y.
    Xiao J.-M.
    Wang X.-H.
    Kongzhi yu Juece/Control and Decision, 2019, 34 (05): : 947 - 955
  • [20] Fault monitoring of batch process based on multi-stage optimization regularized neighborhood preserving embedding algorithm
    Zhao, Xiaoqiang
    Liu, Kai
    Hui, Yonyong
    TRANSACTIONS OF THE INSTITUTE OF MEASUREMENT AND CONTROL, 2023, 45 (01) : 89 - 103