Outlier Detection on Uncertain Data: Objects, Instances, and Inferences

被引:0
|
作者
Jiang, Bin [1 ]
Pei, Jian [1 ]
机构
[1] Simon Fraser Univ, Sch Comp Sci, Burnaby, BC V5A 1S6, Canada
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper studies the problem of outlier detection on uncertain data. We start with a comprehensive model considering both uncertain objects and their instances. An uncertain object has some inherent attributes and consists of a set of instances which are modeled by a probability density distribution. We detect outliers at both the instance level and the object level. To detect outlier instances, it is a prerequisite to know normal instances. By assuming that uncertain objects with similar properties tend to have similar instances, we learn the normal instances for each uncertain object using the instances of objects with similar properties. Consequently, outlier instances can be detected by comparing against normal ones. Furthermore, we can detect outlier objects most of whose instances are outliers. Technically, we use a Bayesian inference algorithm to solve the problem, and develop an approximation algorithm and a filtering algorithm to speed up the computation. An extensive empirical study on both real data and synthetic data verifies the effectiveness and efficiency of our algorithms.
引用
收藏
页码:422 / 433
页数:12
相关论文
共 50 条
  • [1] Uncertain distance-based outlier detection with arbitrarily shaped data objects
    Fabrizio Angiulli
    Fabio Fassetti
    [J]. Journal of Intelligent Information Systems, 2021, 57 : 1 - 24
  • [2] Uncertain distance-based outlier detection with arbitrarily shaped data objects
    Angiulli, Fabrizio
    Fassetti, Fabio
    [J]. JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2021, 57 (01) : 1 - 24
  • [3] Outlier Detection on Uncertain Data Streams
    Zhu, Bin
    Zhong, Yuling
    Wang, Xite
    Bai, Mei
    [J]. Hunan Daxue Xuebao/Journal of Hunan University Natural Sciences, 2020, 47 (02): : 134 - 140
  • [4] Parallel outlier detection on uncertain data for GPUs
    Matsumoto, Takazumi
    Hung, Edward
    Yiu, Man Lung
    [J]. DISTRIBUTED AND PARALLEL DATABASES, 2015, 33 (03) : 417 - 447
  • [5] Parallel outlier detection on uncertain data for GPUs
    Takazumi Matsumoto
    Edward Hung
    Man Lung Yiu
    [J]. Distributed and Parallel Databases, 2015, 33 : 417 - 447
  • [6] Continuous Outlier Detection on Uncertain Data Streams
    Shaikh, Salman Ahmed
    Kitagawa, Hiroyuki
    [J]. 2014 IEEE NINTH INTERNATIONAL CONFERENCE ON INTELLIGENT SENSORS, SENSOR NETWORKS AND INFORMATION PROCESSING (IEEE ISSNIP 2014), 2014,
  • [7] Distance-based outlier detection on uncertain data
    Yu, Hao
    Wang, Bin
    Xiao, Gang
    Yang, Xiaochun
    [J]. Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2010, 47 (03): : 474 - 484
  • [8] SVDD-based outlier detection on uncertain data
    Bo Liu
    Yanshan Xiao
    Longbing Cao
    Zhifeng Hao
    Feiqi Deng
    [J]. Knowledge and Information Systems, 2013, 34 : 597 - 618
  • [9] SVDD-based outlier detection on uncertain data
    Liu, Bo
    Xiao, Yanshan
    Cao, Longbing
    Hao, Zhifeng
    Deng, Feiqi
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2013, 34 (03) : 597 - 618
  • [10] Outlier detection on uncertain data based on local information
    Liu, Jing
    Deng, HuiFang
    [J]. KNOWLEDGE-BASED SYSTEMS, 2013, 51 : 60 - 71