A Variable Markovian based Outlier Detection Method for Multi-dimensional Sequence over Data Stream

被引:0
|
作者
Yang, Dongsheng [1 ]
Wang, Yijie [1 ]
Li, Yongmou [1 ]
Ma, Xingkong [1 ]
机构
[1] Natl Univ Def Technol, Coll Comp, Sci & Technol Parallel & Distributed Proc Lab, Changsha 410073, Hunan, Peoples R China
基金
中国国家自然科学基金;
关键词
multi-dimensional sequence; data stream; outlier detection; feature selection; mutual information; variable Markovian; QUERIES;
D O I
10.1109/PDCAT.2016.48
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Nowadays sequence data tends to be multidimensional sequence over data stream, it has a large state space and arrives at unprecedented speed. It is a big challenge to design a multi-dimensional sequence outlier detection method to meet the accurate and high speed requirements. The traditional methods can't handle multi-dimensional sequence effectively as they have poor abilities for multi-dimensional sequence modeling, and can't detect outlier timely as they have high computational complexity. In this paper we propose a variable Markovian based outlier detection method for multi-dimensional sequence over data stream, VMOD, which consists of two algorithms: mutual information based feature selection algorithm (MIFS), variable Markovian based sequential analysis algorithm (VMSA). It uses MIFS algorithm to reduce the state space and redundant features, and uses VMSA algorithm to accelerate the outlier detection. Through VMOD method, we can improve the detection rate and detection speed. The MIFS algorithm uses mutual information as similarity measures and adopt clustering based strategy to select features, it can improve the abilities for sequence modeling through reducing the state space and redundant features, consequently, to improve the detection rate. The VMSA algorithm use random sample and index structure to accelerate the variable Markovian model construction and reduce the model complexity, consequently, to quicken the outlier detection. The experiments show that VMOD can detect outlier effectively, and reduce the detection time by at least 50% compared with the traditional methods.
引用
收藏
页码:183 / 188
页数:6
相关论文
共 50 条
  • [41] Trajectory-based multi-dimensional outlier detection in wireless sensor networks using Hidden Markov Models
    Chen Wang
    Hongzhi Lin
    Hongbo Jiang
    Wireless Networks, 2014, 20 : 2409 - 2418
  • [42] Sliding window-based outlier detection in mixed data stream
    Su, Xiaoke
    Lan, Yang
    Journal of Computational Information Systems, 2010, 6 (14): : 4905 - 4914
  • [43] Trajectory-based multi-dimensional outlier detection in wireless sensor networks using Hidden Markov Models
    Wang, Chen
    Lin, Hongzhi
    Jiang, Hongbo
    WIRELESS NETWORKS, 2014, 20 (08) : 2409 - 2418
  • [44] Stream Data Preprocessing: Outlier Detection Based on the Chebyshev Inequality with Applications
    Shevlyakov, Georgy
    Kan, Margarita
    PROCEEDINGS OF THE 26TH CONFERENCE OF OPEN INNOVATIONS ASSOCIATION FRUCT, 2020, : 402 - 407
  • [45] A Multi-dimensional Unified Concavity and Convexity Detection Method Based on Geometric Algebra
    Zhang, Jiyi
    Liu, Huanhuan
    Wei, Tianzi
    Liu, Ruitong
    Jia, Chunwang
    Yang, Fan
    ADVANCES IN APPLIED CLIFFORD ALGEBRAS, 2024, 34 (03)
  • [46] Understanding Your History: Multi-dimensional Data Stream Visualization of Personal Lifelogging Data
    Hong, Minsung
    Jung, Jason J.
    2017 13TH INTERNATIONAL CONFERENCE ON INTELLIGENT ENVIRONMENTS (IE 2017), 2017, : 164 - 167
  • [47] A Multi-dimensional Unified Concavity and Convexity Detection Method Based on Geometric Algebra
    Zhang, Jiyi
    Wei, Tianzi
    Liu, Ruitong
    Yang, Fan
    Wei, Yingying
    Wang, Jingyu
    ADVANCES IN COMPUTER GRAPHICS, CGI 2023, PT IV, 2024, 14498 : 188 - 199
  • [48] A Histogram Method for Summarizing Multi-Dimensional Probabilistic Data
    Iqbal, Ashraf
    Wang, Hai
    Gao, Qigang
    4TH INTERNATIONAL CONFERENCE ON AMBIENT SYSTEMS, NETWORKS AND TECHNOLOGIES (ANT 2013), THE 3RD INTERNATIONAL CONFERENCE ON SUSTAINABLE ENERGY INFORMATION TECHNOLOGY (SEIT-2013), 2013, 19 : 971 - 976
  • [49] A Method for Measurement Data Modeling and High-Dimensional Outlier Detection Based on Large Dimensional Matrix
    Chen, Gang
    Fan, Huanhuan
    An, Baoran
    PROCEEDINGS OF THE 33RD CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2021), 2021, : 2274 - 2279
  • [50] Visualization classification method of multi-dimensional data based on radar chart mapping
    Liu, Wen-Yuan
    Wang, Bao-Wen
    Yu, Jia-Xin
    Li, Fang
    Wang, Shui-Xing
    Hong, Wen-Xue
    PROCEEDINGS OF 2008 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2008, : 857 - +