APT Attribution for Malware Based on Time Series Shapelets

被引:3
|
作者
Wang, Qinqin [1 ,2 ]
Yan, Hanbing [3 ]
Zhao, Chang [4 ]
Mei, Rui [1 ,2 ]
Han, Zhihui [3 ]
Zhou, Yu [3 ]
机构
[1] Chinese Acad Sci, Inst Informat Engn, Beijing, Peoples R China
[2] Univ Chinese Acad Sci, Beijing, Peoples R China
[3] Coordinat Ctr China, Natl Comp Network Emergency Response Tech Team, Beijing, Peoples R China
[4] Beijing ChaitinTechnol Co Ltd, Beijing, Peoples R China
基金
国家重点研发计划;
关键词
D O I
10.1109/TrustCom56396.2022.00108
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
To discover and defend against APT attacks more efficiently, we need to conduct binary analysis and source tracing research on APT malicious codes. This paper attributes APT groups for malicious codes from the perspective of binary similarity. First, we innovatively select the local features of the binary functions for classification and apply time series mining techniques to the mining of sequences of basic blocks (called paths). The Shapelet model selects path shapelets, which are path fragments that can best represent paths and are used to distinguish paths. Path shapelets can provide path-level interpretability for classification. Second, we use API calls to filter functions and generate paths of interest to reduce resource consumption. To evaluate the proposed method, we collect APT malicious codes based on publicly available threat intelligence reports. Our method filters 92.82% of functions and generates an average of 1.37 paths per function. The classification effect has obvious advantages over other methods.
引用
下载
收藏
页码:769 / 777
页数:9
相关论文
共 50 条
  • [31] Time Series Shapelets Extraction via Similarity Join
    Zhang Z.
    Wang C.
    Wen Y.
    Yuan X.
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2019, 56 (03): : 594 - 610
  • [32] Localized shapelets selection for interpretable time series classification
    Jiahui Chen
    Yuan Wan
    Applied Intelligence, 2023, 53 : 17985 - 18001
  • [33] Optimal shapelets tree for time series interpretable classification
    Bonasera, Lorenzo
    Gualandi, Stefano
    EURO JOURNAL ON COMPUTATIONAL OPTIMIZATION, 2024, 12
  • [34] On the Mining of the Minimal Set of Time Series Data Shapelets
    Boubrahimi, Soukaina Filali
    Hamdi, Shah Muhammad
    Ma, Ruizhe
    Angryk, Rafal
    2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2020, : 493 - 502
  • [35] Clustering Time Series using Unsupervised-Shapelets
    Zakaria, Jesin
    Mueen, Abdullah
    Keogh, Eamonn
    12TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM 2012), 2012, : 785 - 794
  • [36] SE-shapelets: Semi-supervised Clustering of Time Series Using Representative Shapelets
    Cai, Borui
    Huang, Guangyan
    Yang, Shuiqiao
    Xiang, Yong
    Chi, Chi-Hung
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 240
  • [37] Points2Shapelets: A Salience-Guided Shapelets Selection Approach to Time Series Classification
    Feng, Guanxi
    Ma, Chao
    Zhou, Linjiang
    Wu, Libing
    Zhang, Jingsheng
    Shi, Xiaochuan
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [38] Fast and space-efficient shapelets-based time-series classification
    Gordona, Daniel
    Hendler, Danny
    Rokach, Lior
    INTELLIGENT DATA ANALYSIS, 2015, 19 (05) : 953 - 981
  • [39] W-TSS: A Wavelet-Based Algorithm for Discovering Time Series Shapelets
    Li, Kenan
    Deng, Huiyu
    Morrison, John
    Habre, Rima
    Franklin, Meredith
    Chiang, Yao-Yi
    Sward, Katherine
    Gilliland, Frank D.
    Ambite, Jose Luis
    Eckel, Sandrah P.
    SENSORS, 2021, 21 (17)
  • [40] Optimizing shapelets quality measure for imbalanced time series classification
    Qiuyan Yan
    Yang Cao
    Applied Intelligence, 2020, 50 : 519 - 536