A Variable Markovian based Outlier Detection Method for Multi-dimensional Sequence over Data Stream

被引:0
|
作者
Yang, Dongsheng [1 ]
Wang, Yijie [1 ]
Li, Yongmou [1 ]
Ma, Xingkong [1 ]
机构
[1] Natl Univ Def Technol, Coll Comp, Sci & Technol Parallel & Distributed Proc Lab, Changsha 410073, Hunan, Peoples R China
基金
中国国家自然科学基金;
关键词
multi-dimensional sequence; data stream; outlier detection; feature selection; mutual information; variable Markovian; QUERIES;
D O I
10.1109/PDCAT.2016.48
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Nowadays sequence data tends to be multidimensional sequence over data stream, it has a large state space and arrives at unprecedented speed. It is a big challenge to design a multi-dimensional sequence outlier detection method to meet the accurate and high speed requirements. The traditional methods can't handle multi-dimensional sequence effectively as they have poor abilities for multi-dimensional sequence modeling, and can't detect outlier timely as they have high computational complexity. In this paper we propose a variable Markovian based outlier detection method for multi-dimensional sequence over data stream, VMOD, which consists of two algorithms: mutual information based feature selection algorithm (MIFS), variable Markovian based sequential analysis algorithm (VMSA). It uses MIFS algorithm to reduce the state space and redundant features, and uses VMSA algorithm to accelerate the outlier detection. Through VMOD method, we can improve the detection rate and detection speed. The MIFS algorithm uses mutual information as similarity measures and adopt clustering based strategy to select features, it can improve the abilities for sequence modeling through reducing the state space and redundant features, consequently, to improve the detection rate. The VMSA algorithm use random sample and index structure to accelerate the variable Markovian model construction and reduce the model complexity, consequently, to quicken the outlier detection. The experiments show that VMOD can detect outlier effectively, and reduce the detection time by at least 50% compared with the traditional methods.
引用
收藏
页码:183 / 188
页数:6
相关论文
共 50 条
  • [1] A C-SVM based Anomaly Detection Method for Multi-dimensional Sequence over Data Stream
    Bao, Han
    Wang, Yijie
    [J]. 2016 IEEE 22ND INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS (ICPADS), 2016, : 948 - 955
  • [2] FAAD: an unsupervised fast and accurate anomaly detection method for a multi-dimensional sequence over data stream
    Bin Li
    Yi-jie Wang
    Dong-sheng Yang
    Yong-mou Li
    Xing-kong Ma
    [J]. Frontiers of Information Technology & Electronic Engineering, 2019, 20 : 388 - 404
  • [3] FAAD: an unsupervised fast and accurate anomaly detection method for a multi-dimensional sequence over data stream
    Li, Bin
    Wang, Yi-jie
    Yang, Dong-sheng
    Li, Yong-mou
    Ma, Xing-kong
    [J]. FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2019, 20 (03) : 388 - 404
  • [4] FAAD:an unsupervised fast and accurate anomaly detection method for a multi-dimensional sequence over data stream
    Bin LI
    Yi-jie WANG
    Dong-sheng YANG
    Yong-mou LI
    Xing-kong MA
    [J]. Frontiers of Information Technology & Electronic Engineering, 2019, 20 (03) : 388 - 404
  • [5] Outlier detection based on multi-dimensional clustering and local density
    Shou Zhao-yu
    Li Meng-ya
    Li Si-min
    [J]. JOURNAL OF CENTRAL SOUTH UNIVERSITY, 2017, 24 (06) : 1299 - 1306
  • [6] Outlier detection based on multi-dimensional clustering and local density
    首照宇
    李萌芽
    李思敏
    [J]. Journal of Central South University, 2017, 24 (06) : 1299 - 1306
  • [7] Density-based Outlier Detection in Multi-dimensional Datasets
    Wang, Xite
    Cao, Zhixin
    Zhan, Rongjuan
    Bai, Mei
    Ma, Qian
    Li, Guanyu
    [J]. KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2022, 16 (12): : 3815 - 3835
  • [8] Outlier detection based on multi-dimensional clustering and local density
    Zhao-yu Shou
    Meng-ya Li
    Si-min Li
    [J]. Journal of Central South University, 2017, 24 : 1299 - 1306
  • [9] Outlier Detection for Robust Multi-Dimensional Scaling
    Blouvshtein, Leonid
    Cohen-Or, Daniel
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2019, 41 (09) : 2273 - 2279
  • [10] COID: A cluster–outlier iterative detection approach to multi-dimensional data analysis
    Yong Shi
    Li Zhang
    [J]. Knowledge and Information Systems, 2011, 28 : 709 - 733