Support high-order tensor data description for outlier detection in high-dimensional big sensor data

被引:15
|
作者
Deng, Xiaowu [1 ,2 ,3 ]
Jiang, Peng [1 ]
Peng, Xiaoning [2 ,3 ]
Mi, Chunqiao [2 ,3 ]
机构
[1] Hangzhou Dianzi Univ, Coll Automat, Hangzhou 310018, Zhejiang, Peoples R China
[2] Huaihua Univ, Sch Comp Sci & Engn, Huaihua 418000, Peoples R China
[3] Hunan Prov Key Lab Ecol Agr Intelligent Control T, Huaihua 418000, Peoples R China
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
Big sensor data; High-dimensional data; Outlier detection; CP factorization; KSTDD; MODELS;
D O I
10.1016/j.future.2017.10.013
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The various high-dimensional sensor data can be collected by wireless sensor networks, video monitoring systems and multimedia sensor networks, while High-dimensional sensor data is inherently large-scale because each sensor node has spatial attributes and may also be associated with large amounts of measurement data evolving over time. Detecting outlier in high-dimensional big sensor data is a challenging task. Most of existing outlier detection methods is based on vector representation. However, high-dimensional sensor data is naturally described by tensor representations. The vector-based methods can lead to destroy original structural information and correlation for high-dimensional sensors data, result in the problem of curse of dimensionality, and some outliers cannot be detected. To solve this problem, support high-order tensor data description (STDD) and kernel support high-order tensor data description (KSTDD) are proposed to detect outliers for tensor data. STDD and KSTDD extend support vector data description from vector space to tensor space. KSTDD maintains the structural information of data, avoids the problem caused by the vectorization of tensor data, and improves the performance of outlier detection. Experiments on four sensor datasets show that the proposed method is superior to the traditional vectorized data analysis method. (C) 2017 Elsevier B.V. All rights reserved.
引用
收藏
页码:177 / 187
页数:11
相关论文
共 50 条
  • [21] Optimal Sparse Singular Value Decomposition for High-Dimensional High-Order Data
    Zhang, Anru
    Han, Rungang
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2019, 114 (528) : 1708 - 1725
  • [22] Adaptive Clustering for Outlier Identification in High-Dimensional Data
    Thudumu, Srikanth
    Branch, Philip
    Jin, Jiong
    Singh, Jugdutt
    ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2019, PT II, 2020, 11945 : 215 - 228
  • [23] OUTLIER DETECTION WITH ENHANCED ANGLE-BASED OUTLIER FACTOR IN HIGH-DIMENSIONAL DATA STREAM
    Shou, Zhaoyu
    Tian, Hao
    Li, Simin
    Zou, Fengbo
    INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2018, 14 (05): : 1633 - 1651
  • [24] Outlier mining in large high-dimensional data sets
    Angiulli, F
    Pizzuti, C
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2005, 17 (02) : 203 - 215
  • [25] Tensor Quantization: High-Dimensional Data Compression
    Chang, Shih Yu
    Wu, Hsiao-Chun
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (08) : 5566 - 5580
  • [26] Metric Learning for High-Dimensional Tensor Data
    Shi Jiarong
    Jiao Licheng
    Shang Fanhua
    CHINESE JOURNAL OF ELECTRONICS, 2011, 20 (03): : 495 - 498
  • [27] Weighted Outlier Detection of High-Dimensional Categorical Data Using Feature Grouping
    Li, Junli
    Zhang, Jifu
    Pang, Ning
    Qin, Xiao
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2020, 50 (11): : 4295 - 4308
  • [28] IPMOD: An efficient outlier detection model for high-dimensional medical data streams
    Yang, Yun
    Fan, ChongJun
    Chen, Liang
    Xiong, HongLin
    EXPERT SYSTEMS WITH APPLICATIONS, 2022, 191
  • [29] Computationally Efficient Outlier Detection for High-Dimensional Data Using the MDP Algorithm
    Tsagris, Michail
    Papadakis, Manos
    Alenazi, Abdulaziz
    Alzeley, Omar
    COMPUTATION, 2024, 12 (09)
  • [30] An Unbiased Distance-Based Outlier Detection Approach for High-Dimensional Data
    Hoang Vu Nguyen
    Gopalkrishnan, Vivekanand
    Assent, Ira
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, PT I, 2011, 6587 : 138 - +