Anomaly Detection from Incomplete Data

被引:19
|
作者
Liu, Siyuan [1 ]
Chen, Lei [2 ]
Ni, Lionel M. [2 ]
机构
[1] Carnegie Mellon Univ, Heinz Coll, Pittsburgh, PA 15213 USA
[2] Hong Kong Univ Sci & Technol, Dept Comp Sci & Engn, Hong Kong, Hong Kong, Peoples R China
基金
新加坡国家研究基金会;
关键词
Anomaly detection; mobile phone; correlation-based clustering; anomaly trajectory; EVENTS;
D O I
10.1145/2629668
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Anomaly detection (a.k.a., outlier or burst detection) is a well-motivated problem and a major data mining and knowledge discovery task. In this article, we study the problem of population anomaly detection, one of the key issues related to event monitoring and population management within a city. Through studying detected population anomalies, we can trace and analyze these anomalies, which could help to model city traffic design and event impact analysis and prediction. Although a significant and interesting issue, it is very hard to detect population anomalies and retrieve anomaly trajectories, especially given that it is difficult to get actual and sufficient population data. To address the difficulties of a lack of real population data, we take advantage of mobile phone networks, which offer enormous spatial and temporal communication data on persons. More importantly, we claim that we can utilize these mobile phone data to infer and approximate population data. Thus, we can study the population anomaly detection problem by taking advantages of unique features hidden in mobile phone data. In this article, we present a system to conduct Population Anomaly Detection (PAD). First, we propose an effective clustering method, correlation-based clustering, to cluster the incomplete location information from mobile phone data (i.e., from mobile call volume distribution to population density distribution). Then, we design an adaptive parameter-free detection method, R-scan, to capture the distributed dynamic anomalies. Finally, we devise an efficient algorithm, BT-miner, to retrieve anomaly trajectories. The experimental results from real-life mobile phone data confirm the effectiveness and efficiency of the proposed algorithms. Finally, the proposed methods are realized as a pilot system in a city in China.
引用
收藏
页数:22
相关论文
共 50 条
  • [1] Network anomaly detection with incomplete audit data
    Patcha, Animesh
    Park, Jung-Min
    [J]. COMPUTER NETWORKS, 2007, 51 (13) : 3935 - 3955
  • [2] Research abstract for semantic anomaly detection in dynamic data feeds with incomplete specifications
    Raz, O
    [J]. ICSE 2002: PROCEEDINGS OF THE 24TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, 2002, : 733 - 734
  • [3] A Deep Similarity Metric Method Based on Incomplete Data for Traffic Anomaly Detection in IoT
    Kang, Xu
    Song, Bin
    Sun, Fengyao
    [J]. APPLIED SCIENCES-BASEL, 2019, 9 (01):
  • [4] (1 + Ε)-class classification: An anomaly detection method for highly imbalanced or incomplete data sets
    Borisyak, Maxim
    Ryzhikov, Artem
    Ustyuzhanin, Andrey
    Derkach, Denis
    Ratnikov, Fedor
    Mineeva, Olga
    [J]. 1600, Microtome Publishing (21):
  • [5] Anomaly Detection from Call Data Records
    Nithi
    Dey, Lipika
    [J]. PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PROCEEDINGS, 2009, 5909 : 237 - 242
  • [6] Time Series Data Cleaning: From Anomaly Detection to Anomaly Repairing
    Zhang, Aoqian
    Song, Shaoxu
    Wang, Jianmin
    Yu, Philip S.
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2017, 10 (10): : 1046 - 1057
  • [7] Unsupervised Anomaly Detection for Multivariate Incomplete Data using GAN-based Data Imputation: A Comparative Study
    Sarda, Kisan
    Yerudkar, Amol
    Del Vecchio, Carmen
    [J]. 2023 31ST MEDITERRANEAN CONFERENCE ON CONTROL AND AUTOMATION, MED, 2023, : 55 - 60
  • [8] (1+ε)-class Classification: an Anomaly Detection Method for Highly Imbalanced or Incomplete Data Sets
    Borisyak, Maxim
    Ryzhikov, Artem
    Ustyuzhanin, Andrey
    Derkach, Denis
    Ratnikov, Fedor
    Mineeva, Olga
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2020, 21
  • [9] Anomaly detection in big data from UWB radars
    Wang, Wei
    Zhou, Xin
    Zhang, Baoju
    Mu, Jiasong
    [J]. SECURITY AND COMMUNICATION NETWORKS, 2015, 8 (14) : 2469 - 2475
  • [10] Correlated Anomaly Detection from Large Streaming Data
    Chen, Zheng
    Yu, Xinli
    Ling, Yuan
    Song, Bo
    Quan, Wei
    Hu, Xiaohua
    Yan, Erjia
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2018, : 982 - 992