Outlier detection from large distributed databases

被引:0
|
作者
Ji Zhang
Xiaohui Tao
Hua Wang
机构
[1] University of Southern Queensland,Department of Mathematics and Computing
来源
World Wide Web | 2014年 / 17卷
关键词
Data mining; Distributed database; Outlier detection;
D O I
暂无
中图分类号
学科分类号
摘要
In this paper, we present an innovative system, coined as DISTROD (a.k.a DISTRibuted Outlier Detector), for detecting outliers, namely abnormal instances or observations, from multiple large distributed databases. DISTROD is able to effectively detect the so-called global outliers from distributed databases that are consistent with those produced by the centralized detection paradigm. DISTROD is equipped with a number of optimization/boosting strategies which empower it to significantly enhance its speed performance and reduce its communication overhead. Experimental evaluation demonstrates the good performance of DISTROD in terms of speed and communication overhead.
引用
收藏
页码:539 / 568
页数:29
相关论文
共 50 条
  • [21] Outlier detection in large data sets
    Buzzi-Ferraris, Guido
    Manenti, Flavio
    COMPUTERS & CHEMICAL ENGINEERING, 2011, 35 (02) : 388 - 390
  • [22] Distributed algorithm for mining multilevel association rules from large databases
    Wang, Chunhua
    Huang, Houkuan
    Tian, Shengfeng
    Wang, Zhihai
    Tiedao Xuebao/Journal of the China Railway Society, 2000, 22 (05): : 47 - 50
  • [23] PARALLEL CYCLE DETECTION IN DISTRIBUTED DATABASES
    SCHALL, E
    INFORMATION SYSTEMS, 1990, 15 (05) : 555 - 566
  • [24] DEADLOCK DETECTION IN DISTRIBUTED DATABASES.
    Knapp, Edgar
    Computing surveys, 1987, 19 (04): : 303 - 328
  • [25] DETECTION OF MUTUAL INCONSISTENCY IN DISTRIBUTED DATABASES
    RAMARAO, KVS
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 1989, 6 (03) : 498 - 514
  • [26] Parallel Outlier Detection Algorithm in Heterogeneous Distributed Environment
    Wang X.
    Zhu Z.
    Yu X.
    Bai M.
    Hunan Daxue Xuebao/Journal of Hunan University Natural Sciences, 2020, 47 (10): : 100 - 110
  • [27] Cell-based outlier detection algorithm: A fast outlier detection algorithm for large datasets
    Wan, You
    Bian, Fuling
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2008, 5012 : 1042 - 1048
  • [28] A Cost Effective Algorithm for Outlier Detection in Distributed Systems
    Devi, R. Delshi Howsalya
    Devi, M. Indra
    2012 IEEE INTERNATIONAL CONFERENCE ON ADVANCED COMMUNICATION CONTROL AND COMPUTING TECHNOLOGIES (ICACCCT), 2012, : 37 - 40
  • [29] Continuous adaptive outlier detection on distributed data streams
    Su, Liang
    Han, Weihong
    Yang, Shuqiang
    Zou, Peng
    Jia, Yan
    HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS, PROCEEDINGS, 2007, 4782 : 74 - 85
  • [30] Outlier Detection for Large Scale Manufacturing Processes
    Jauhri, Abhinav
    McDanel, Bradley
    Connor, Chris
    PROCEEDINGS 2015 IEEE INTERNATIONAL CONFERENCE ON BIG DATA, 2015, : 2771 - 2774