Multi-stage Mixed Attribute Outlier Detection Algorithm Based on Neighborhood Density Difference

被引:0
|
作者
Du, Haizhou [1 ]
Fang, Wei [1 ]
Liu, Qing [1 ]
Yang, Zhenchen [2 ]
Wang, Xiaofeng [3 ]
机构
[1] Shanghai Univ Elect Power, Sch Comp Sci & Technol, Shanghai, Peoples R China
[2] Shanghai Elect Power Xinda New Energy Technol Co, Shanghai, Peoples R China
[3] State Grid Hangzhou Xiaoshan Power Supply Co, Hangzhou, Peoples R China
关键词
Outlier detection; Mixed attribute; Neighborhood density difference;
D O I
10.1109/BIGCOM.2019.00031
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the development of big data, an increasing number of mixed attribute dataset has become ubiquitous. It is also extremely important for decision analysis in the processing of a mixed attribute dataset. The existing outlier detection algorithm does not process a mixed attribute dataset and some special objects, because of too high computational time complexity and unsatisfactory detection results. In this paper, we present a multi-stage mixed attribute outlier detection algorithm. Firstly, with data set being divided, the neighborhood information was constructed based on the heterogeneous similarity metric to generate the core point. Then, the primitive clusters can be formed on the basis of the definition. Finally, a neighborhood density difference metric-based outlier detection algorithm was designed to construct neighborhood outlier factor (NOF). Extensive experimental results show the advantages of the proposed method, which could improve the outlier detection accuracy and reduce the time complexity on mixed attributed.
引用
收藏
页码:160 / 168
页数:9
相关论文
共 50 条
  • [1] Multigranulation Relative Entropy-Based Mixed Attribute Outlier Detection in Neighborhood Systems
    Yuan, Zhong
    Chen, Hongmei
    Li, Tianrui
    Zhang, Xianyong
    Sang, Binbin
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2022, 52 (08): : 5175 - 5187
  • [2] Density-Distance Outlier Detection Algorithm Based on Natural Neighborhood
    Zhang, Jiaxuan
    Yang, Youlong
    AXIOMS, 2023, 12 (05)
  • [3] Data outlier detection algorithm based on density difference of double radius
    Department of Computer Science and Technology, Beijing University of Posts and Telecommunications, Beijing 100876, China
    不详
    不详
    Gaojishu Tongxin/Chinese High Technology Letters, 2008, 18 (04): : 350 - 354
  • [4] Complex multi-stage decision making method based on mixed multi-attribute information
    Xu, Xuan-Hua
    Cai, Chen-Guang
    Liang, Dong
    Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics, 2015, 37 (10): : 2315 - 2321
  • [5] Outlier Detection of Mixed Data Based on Neighborhood Combinatorial Entropy
    Wang, Lina
    Zhang, Qixiang
    Niu, Xiling
    Ren, Yongjun
    Xia, Jinyue
    CMC-COMPUTERS MATERIALS & CONTINUA, 2021, 69 (02): : 1765 - 1781
  • [6] Outlier detection using neighborhood rank difference
    Bhattacharya, Gautam
    Ghosh, Koushik
    Chowdhury, Ananda S.
    PATTERN RECOGNITION LETTERS, 2015, 60-61 : 24 - 31
  • [7] MEOD: A Robust Multi-stage Ensemble Model Based on Rank Aggregation and Stacking for Outlier Detection
    Jiang, Zhengchao
    Zhang, Fan
    Xu, Hao
    Tao, Li
    Zhang, Zili
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, KSEM 2022, PT III, 2022, 13370 : 205 - 218
  • [8] An Effective Pattern Based Outlier Detection Approach for Mixed Attribute Data
    Zhang, Ke
    Jin, Huidong
    AI 2010: ADVANCES IN ARTIFICIAL INTELLIGENCE, 2010, 6464 : 122 - 131
  • [9] Outlier Detection Based on Fuzzy Rough Granules in Mixed Attribute Data
    Yuan, Zhong
    Chen, Hongmei
    Li, Tianrui
    Sang, Binbin
    Wang, Shu
    IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (08) : 8399 - 8412
  • [10] ODRA: an outlier detection algorithm based on relevant attribute analysis method
    Abdul Wahid
    Annavarapu Chandra Sekhara Rao
    Cluster Computing, 2021, 24 : 569 - 585