A GPU Algorithm for Detecting Contextual Outliers in Multiple Concurrent Data Streams

被引:5
|
作者
Borah, Abinash [1 ]
Gruenwald, Le [1 ]
Leal, Eleazar [2 ]
Panjei, Egawati [1 ]
机构
[1] Univ Oklahoma, Sch Comp Sci, Norman, OK 73019 USA
[2] Univ Minnesota, Dept Comp Sci, Duluth, MN 55812 USA
来源
2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA) | 2021年
基金
美国国家科学基金会;
关键词
Data Stream; Outlier Detection; Contextual Outlier; GPU;
D O I
10.1109/BigData52589.2021.9671460
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A data stream is an infinite sequence of data points generated from a source continuously at a fast rate, which is characterized by the transiency of the data points, the temporal relationship among the data points, concept drift, and multi-dimensionality of data points. Outlier detection in data streams thus needs to deal with the characteristics of Big Data applications such as volume, velocity, and variety. The problem of detecting outliers in multiple concurrent data streams introduces additional challenges to the problem. In this paper, we propose a parallel outlier detection technique CODS to detect Contextual Outliers in multiple concurrent independent multi-dimensional Data Streams using a Graphics Processing Unit (GPU). The proposed algorithm addresses all the aforesaid characteristics of data streams. A set of experiments demonstrates reasonable outlier detection accuracy and scalability of CODS with the number of data streams.
引用
收藏
页码:2737 / 2742
页数:6
相关论文
共 50 条
  • [1] Detecting Projected Outliers in High-Dimensional Data Streams
    Zhang, Ji
    Gao, Qigang
    Wang, Hai
    Liu, Qing
    Xu, Kai
    DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2009, 5690 : 629 - +
  • [2] Algorithm for Detecting Outliers in Bluetooth Data in Real Time
    Moghaddam, Soroush Salek
    Hellinga, Bruce
    TRANSPORTATION RESEARCH RECORD, 2014, (2442) : 129 - 139
  • [3] AN ALGORITHM OF DETECTING OUTLIERS IN SVR
    Zeng, Shaohua
    Tang, Yuanyan
    Wei, Yan
    Qin, Hanshu
    INTERNATIONAL JOURNAL OF WAVELETS MULTIRESOLUTION AND INFORMATION PROCESSING, 2012, 10 (05)
  • [4] DISTRO: A System for Detecting Global Outliers from Distributed Data Streams with Privacy Protection
    Zhang, Ji
    Dekeyser, Stijn
    Wang, Hua
    Shu, Yanfeng
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, PT II, PROCEEDINGS, 2010, 5982 : 477 - +
  • [5] Detecting Outliers in Data Streams Based on Minimum Rare Pattern Mining and Pattern Matching
    Li, Yun
    Cai, Saihua
    INFORMATION TECHNOLOGY AND CONTROL, 2022, 51 (02): : 268 - 282
  • [6] SPOT: A system for detecting projected outliers from high-dimensional data streams
    Zhang, Ji
    Gao, Qigang
    Wang, Hai
    2008 IEEE 24TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, VOLS 1-3, 2008, : 1628 - +
  • [7] An efficient scheme for detecting phenomena in multiple data streams
    Salem, Thuraya Awadh
    Kamel, Ibrahim
    Al Aghbari, Zaher
    2007 INNOVATIONS IN INFORMATION TECHNOLOGIES, VOLS 1 AND 2, 2007, : 732 - 734
  • [8] EXOS: Explaining Outliers in Data Streams
    Panjei, Egawati
    Gruenwald, Le
    BIG DATA ANALYTICS AND KNOWLEDGE DISCOVERY, DAWAK 2023, 2023, 14148 : 25 - 41
  • [9] Wadjet: Finding Outliers in Multiple Multi-dimensional Heterogeneous Data Streams
    Sadik, Shiblee
    Gruenwald, Le
    Leal, Eleazar
    2018 IEEE 34TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2018, : 1232 - 1235
  • [10] Detecting spatial Outliers with multiple attributes
    Lu, CT
    Chen, DC
    Kou, YF
    15TH IEEE INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2003, : 122 - 128