A Variable Markovian based Outlier Detection Method for Multi-dimensional Sequence over Data Stream

被引:0
|
作者
Yang, Dongsheng [1 ]
Wang, Yijie [1 ]
Li, Yongmou [1 ]
Ma, Xingkong [1 ]
机构
[1] Natl Univ Def Technol, Coll Comp, Sci & Technol Parallel & Distributed Proc Lab, Changsha 410073, Hunan, Peoples R China
基金
中国国家自然科学基金;
关键词
multi-dimensional sequence; data stream; outlier detection; feature selection; mutual information; variable Markovian; QUERIES;
D O I
10.1109/PDCAT.2016.48
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Nowadays sequence data tends to be multidimensional sequence over data stream, it has a large state space and arrives at unprecedented speed. It is a big challenge to design a multi-dimensional sequence outlier detection method to meet the accurate and high speed requirements. The traditional methods can't handle multi-dimensional sequence effectively as they have poor abilities for multi-dimensional sequence modeling, and can't detect outlier timely as they have high computational complexity. In this paper we propose a variable Markovian based outlier detection method for multi-dimensional sequence over data stream, VMOD, which consists of two algorithms: mutual information based feature selection algorithm (MIFS), variable Markovian based sequential analysis algorithm (VMSA). It uses MIFS algorithm to reduce the state space and redundant features, and uses VMSA algorithm to accelerate the outlier detection. Through VMOD method, we can improve the detection rate and detection speed. The MIFS algorithm uses mutual information as similarity measures and adopt clustering based strategy to select features, it can improve the abilities for sequence modeling through reducing the state space and redundant features, consequently, to improve the detection rate. The VMSA algorithm use random sample and index structure to accelerate the variable Markovian model construction and reduce the model complexity, consequently, to quicken the outlier detection. The experiments show that VMOD can detect outlier effectively, and reduce the detection time by at least 50% compared with the traditional methods.
引用
收藏
页码:183 / 188
页数:6
相关论文
共 50 条
  • [21] A Data Stream Outlier Detection Algorithm Based on Grid
    Yu Xiang
    Lei Guohua
    Xu Xiandong
    Lin Liandong
    [J]. 2015 27TH CHINESE CONTROL AND DECISION CONFERENCE (CCDC), 2015, : 4136 - 4141
  • [22] An Efficient Algorithm for Distributed Outlier Detection in Large Multi-Dimensional Datasets
    Wang, Xi-Te
    Shen, De-Rong
    Bai, Mei
    Nie, Tie-Zheng
    Kou, Yue
    Yu, Ge
    [J]. JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2015, 30 (06) : 1233 - 1248
  • [23] An Efficient Algorithm for Distributed Outlier Detection in Large Multi-Dimensional Datasets
    Xi-Te Wang
    De-Rong Shen
    Mei Bai
    Tie-Zheng Nie
    Yue Kou
    Ge Yu
    [J]. Journal of Computer Science and Technology, 2015, 30 : 1233 - 1248
  • [24] Oui! Outlier Interpretation on Multi-dimensional Data via Visual Analytics
    Zhao, Xun
    Cui, Weiwei
    Wu, Yanhong
    Zhang, Haidong
    Qui, Huamin
    Zhang, Dongmei
    [J]. COMPUTER GRAPHICS FORUM, 2019, 38 (03) : 213 - 224
  • [25] A multi-dimensional wavelet-based anomaly detection method
    Wu, Shuyan
    Li, Xiaoge
    Zhang, Bin
    Qin, Donghong
    [J]. ICIC Express Letters, 2015, 9 (12): : 3393 - 3399
  • [26] Research of an Improved PCA Method for Abnormality Diagnosis in Synchronous Multi-dimensional Data Stream
    Yang, Tongyao
    Wang, Bin
    Li, Chuan
    He, Bi
    [J]. 2013 32ND CHINESE CONTROL CONFERENCE (CCC), 2013, : 7797 - 7802
  • [27] A New Anomaly Detection Method Based on Multi-dimensional Condition Monitoring Data for Aircraft Engine
    Chen, Shaowei
    Wu, Meng
    Zhao, Shuai
    Wen, Pengfei
    Huang, Dengshan
    Wang, Yan
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON PROGNOSTICS AND HEALTH MANAGEMENT (ICPHM), 2019,
  • [28] Stream cube: An architecture for multi-dimensional analysis of data streams
    Han, JW
    Chen, YX
    Dong, GZ
    Pei, H
    Wah, BW
    Wang, JY
    Cai, YD
    [J]. DISTRIBUTED AND PARALLEL DATABASES, 2005, 18 (02) : 173 - 197
  • [29] Stream Cube: An Architecture for Multi-Dimensional Analysis of Data Streams
    Jiawei Han
    Yixin Chen
    Guozhu Dong
    Jian Pei
    Benjamin W. Wah
    Jianyong Wang
    Y. Dora Cai
    [J]. Distributed and Parallel Databases, 2005, 18 : 173 - 197
  • [30] An Efficient Outlier Detection Approach Over Uncertain Data Stream Based on Frequent Itemset Mining
    Hao, Shangbo
    Cai, Saihua
    Sun, Ruizhi
    Li, Sicong
    [J]. INFORMATION TECHNOLOGY AND CONTROL, 2019, 48 (01): : 34 - 46