APT Attribution for Malware Based on Time Series Shapelets

被引:3
|
作者
Wang, Qinqin [1 ,2 ]
Yan, Hanbing [3 ]
Zhao, Chang [4 ]
Mei, Rui [1 ,2 ]
Han, Zhihui [3 ]
Zhou, Yu [3 ]
机构
[1] Chinese Acad Sci, Inst Informat Engn, Beijing, Peoples R China
[2] Univ Chinese Acad Sci, Beijing, Peoples R China
[3] Coordinat Ctr China, Natl Comp Network Emergency Response Tech Team, Beijing, Peoples R China
[4] Beijing ChaitinTechnol Co Ltd, Beijing, Peoples R China
基金
国家重点研发计划;
关键词
D O I
10.1109/TrustCom56396.2022.00108
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
To discover and defend against APT attacks more efficiently, we need to conduct binary analysis and source tracing research on APT malicious codes. This paper attributes APT groups for malicious codes from the perspective of binary similarity. First, we innovatively select the local features of the binary functions for classification and apply time series mining techniques to the mining of sequences of basic blocks (called paths). The Shapelet model selects path shapelets, which are path fragments that can best represent paths and are used to distinguish paths. Path shapelets can provide path-level interpretability for classification. Second, we use API calls to filter functions and generate paths of interest to reduce resource consumption. To evaluate the proposed method, we collect APT malicious codes based on publicly available threat intelligence reports. Our method filters 92.82% of functions and generates an average of 1.37 paths per function. The classification effect has obvious advantages over other methods.
引用
下载
收藏
页码:769 / 777
页数:9
相关论文
共 50 条
  • [41] Time Series Retrieval Using DTW-Preserving Shapelets
    Sperandio, Ricardo Carlini
    Malinowski, Simon
    Amsaleg, Laurent
    Tavenard, Romain
    SIMILARITY SEARCH AND APPLICATIONS, SISAP 2018, 2018, 11223 : 257 - 270
  • [42] Extracting diverse-shapelets for early classification on time series
    Wenhe Yan
    Guiling Li
    Zongda Wu
    Senzhang Wang
    Philip S. Yu
    World Wide Web, 2020, 23 : 3055 - 3081
  • [43] One-Class Learning Time-Series Shapelets
    Yamaguchi, Akihiro
    Nishikawa, Takeichiro
    2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2018, : 2365 - 2372
  • [44] Effect of Mahalanobis Distance on Time Series Classification Using Shapelets
    Arathi, M.
    Govardhan, A.
    EMERGING ICT FOR BRIDGING THE FUTURE, VOL 2, 2015, 338 : 525 - 535
  • [45] Identifying APT Malware Domain Based on Mobile DNS Logging
    Niu, Weina
    Zhang, Xiaosong
    Yang, Guowu
    Zhu, Jianan
    Ren, Zhongwei
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2017, 2017
  • [46] Malware Triage Based on Static Features and Public APT Reports
    Laurenza, Giuseppe
    Aniello, Leonardo
    Lazzeretti, Riccardo
    Baldoni, Roberto
    CYBER SECURITY CRYPTOGRAPHY AND MACHINE LEARNING (CSCML 2017), 2017, 10332 : 288 - 305
  • [47] Random pairwise shapelets forest: an effective classifier for time series
    Yuan, Jidong
    Shi, Mohan
    Wang, Zhihai
    Liu, Haiyang
    Li, Jinyang
    KNOWLEDGE AND INFORMATION SYSTEMS, 2022, 64 (01) : 143 - 174
  • [48] Optimizing shapelets quality measure for imbalanced time series classification
    Yan, Qiuyan
    Cao, Yang
    APPLIED INTELLIGENCE, 2020, 50 (02) : 519 - 536
  • [49] Efficient Learning Interpretable Shapelets for Accurate Time Series Classification
    Fang, Zicheng
    Wang, Peng
    Wang, Wei
    2018 IEEE 34TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2018, : 497 - 508
  • [50] Learning DTW-Shapelets for Time-Series Classification
    Shah, Mit
    Grabocka, Josif
    Schilling, Nicolas
    Wistuba, Martin
    Schmidt-Thieme, Lars
    PROCEEDINGS OF THE THIRD ACM IKDD CONFERENCE ON DATA SCIENCES (CODS), 2016,