Outlier detection from large distributed databases

被引:0
|
作者
Ji Zhang
Xiaohui Tao
Hua Wang
机构
[1] University of Southern Queensland,Department of Mathematics and Computing
来源
World Wide Web | 2014年 / 17卷
关键词
Data mining; Distributed database; Outlier detection;
D O I
暂无
中图分类号
学科分类号
摘要
In this paper, we present an innovative system, coined as DISTROD (a.k.a DISTRibuted Outlier Detector), for detecting outliers, namely abnormal instances or observations, from multiple large distributed databases. DISTROD is able to effectively detect the so-called global outliers from distributed databases that are consistent with those produced by the centralized detection paradigm. DISTROD is equipped with a number of optimization/boosting strategies which empower it to significantly enhance its speed performance and reduce its communication overhead. Experimental evaluation demonstrates the good performance of DISTROD in terms of speed and communication overhead.
引用
收藏
页码:539 / 568
页数:29
相关论文
共 50 条
  • [1] Outlier detection from large distributed databases
    Zhang, Ji
    Tao, Xiaohui
    Wang, Hua
    WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2014, 17 (04): : 539 - 568
  • [2] A novel Outlier Detection Algorithm for Distributed Databases
    Zhou, Jiaogen
    Zhao, Chunjiang
    Wan, You
    Huang, Wenjiang
    Yang, Baozhu
    Ge, Jixin
    FIFTH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, VOL 5, PROCEEDINGS, 2008, : 293 - +
  • [3] Spatio-temporal outlier detection in large databases
    Dokuz Eylul University, Department of Computer Engineering, Izmir
    35100, Turkey
    J. Compt. Inf. Technol., 2006, 4 (291-297):
  • [4] Spatio-temporal outlier detection in large databases
    Birant, Derya
    Kut, Alp
    ITI 2006: PROCEEDINGS OF THE 28TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY INTERFACES, 2006, : 179 - +
  • [5] A distributed algorithm for outlier detection in a large database
    Sarker, BK
    Kitagawa, H
    DATABASES IN NETWORKED INFORMATION SYSTEMS, PROCEEDINGS, 2005, 3433 : 300 - 309
  • [6] Multivariate outlier detection and remediation in geochemical databases
    Lalor, GC
    Zhang, CS
    SCIENCE OF THE TOTAL ENVIRONMENT, 2001, 281 (1-3) : 99 - 109
  • [7] An Efficient Algorithm for Distributed Outlier Detection in Large Multi-Dimensional Datasets
    Wang, Xi-Te
    Shen, De-Rong
    Bai, Mei
    Nie, Tie-Zheng
    Kou, Yue
    Yu, Ge
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2015, 30 (06) : 1233 - 1248
  • [8] An Efficient Algorithm for Distributed Outlier Detection in Large Multi-Dimensional Datasets
    Xi-Te Wang
    De-Rong Shen
    Mei Bai
    Tie-Zheng Nie
    Yue Kou
    Ge Yu
    Journal of Computer Science and Technology, 2015, 30 : 1233 - 1248
  • [9] Adaptive Distributed Outlier Detection for WSNs
    De Paola, Alessandra
    Gaglio, Salvatore
    Lo Re, Giuseppe
    Milazzo, Fabrizio
    Ortolani, Marco
    IEEE TRANSACTIONS ON CYBERNETICS, 2015, 45 (05) : 888 - 899
  • [10] SPARX: Distributed Outlier Detection at Scale
    Zhang, Sean
    Ursekar, Varun
    Akoglu, Leman
    PROCEEDINGS OF THE 28TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2022, 2022, : 4530 - 4540