Learning from Time Series with Outlier Correction for Malicious Domain Identification

被引:0
|
作者
Tan, Guolin [1 ,2 ]
Zhang, Peng [1 ]
Zhang, Lei [1 ,2 ]
Zhang, Yu [1 ,2 ]
Zhang, Chuang [1 ]
Liu, Qingyun [1 ]
Liu, Xinran [3 ]
机构
[1] Chinese Acad Sci, Inst Informat Engn, Beijing, Peoples R China
[2] Univ Chinese Acad Sci, Sch Cyber Secur, Beijing, Peoples R China
[3] Natl Comp Network Emergency Response & Coordinat, Beijing, Peoples R China
关键词
D O I
10.1109/ISSREW.2019.00040
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Malicious domain identification is an important task in the field of cyberspace security. However, most of existing work for this task heavily relies on expert experience when constructing machine learning features. What makes matters worse is that these features can be deliberately changed by attackers. As a result, such malicious domain identification methods are easily bypassed by cyber criminals. To solve this problem, in this paper, we propose a novel method for malicious domain identification by effectively learning time series shapelets, the discriminative local patterns of time series. More specifically, our method consists of two main components: 1) modeling user's habits of accessing domains by learning shapelets from domain time series. As the domain time series is generated by the crowd visiting websites, the learned user's habits of accessing domains can potentially reflect what type of service a domain provides, such as pornography, gambling and so on. 2) an outlier correction algorithm designed for a single time series and independent of the model which can enhance the robustness of shapelet initialization. We integrate shapelet learning and outlier correction in our model. Extensive experiments on real-world dataset demonstrates that our proposed method has better performance compared with state-of-the-art methods.
引用
收藏
页码:42 / 46
页数:5
相关论文
共 50 条
  • [11] Time series outlier detection and imputation
    Akouemo, Hermine N.
    Povinelli, Richard J.
    2014 IEEE PES GENERAL MEETING - CONFERENCE & EXPOSITION, 2014,
  • [12] ON OUTLIER DETECTION IN TIME-SERIES
    LJUNG, GM
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1993, 55 (02): : 559 - 567
  • [13] Outlier identification and correction for GRACE aggregated data
    Tourian, Mohammad J.
    Riegger, Johannes
    Sneeuw, Nico
    Devaraju, Balaji
    STUDIA GEOPHYSICA ET GEODAETICA, 2011, 55 (04) : 627 - 640
  • [14] Outlier identification and correction for GRACE aggregated data
    Mohammad J. Tourian
    Johannes Riegger
    Nico Sneeuw
    Balaji Devaraju
    Studia Geophysica et Geodaetica, 2011, 55 : 627 - 640
  • [15] Unsupervised feature extraction from multivariate time series for outlier detection
    Matsue, Kiyotaka
    Sugiyama, Mahito
    INTELLIGENT DATA ANALYSIS, 2022, 26 (06) : 1451 - 1467
  • [16] TIME-DOMAIN STRUCTURAL DAMAGE IDENTIFICATION: FROM A DICTIONARY LEARNING PERSPECTIVE
    Wang, Ying
    Zhang, Tong
    Hao, Hong
    PROCEEDINGS OF THE THIRTEENTH INTERNATIONAL SYMPOSIUM ON STRUCTURAL ENGINEERING, VOLS 1 AND II, 2014, : 1215 - 1222
  • [17] Malicious Text Identification: Deep Learning from Public Comments and Emails
    Baccouche, Asma
    Ahmed, Sadaf
    Sierra-Sosa, Daniel
    Elmaghraby, Adel
    INFORMATION, 2020, 11 (06)
  • [18] Outlier Impact Characterization for Time Series Data
    Li, Jianbo
    Zheng, Lecheng
    Zhu, Yada
    He, Jingrui
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 11595 - 11603
  • [19] OUTLIER DETECTION AND TIME-SERIES MODELING
    ABRAHAM, B
    CHUANG, A
    TECHNOMETRICS, 1989, 31 (02) : 241 - 248
  • [20] Detection of outlier patches in autoregressive time series
    Justel, A
    Peña, D
    Tsay, RS
    STATISTICA SINICA, 2001, 11 (03) : 651 - 673