A Universal Malicious Documents Static Detection Framework Based on Feature Generalization

被引：10

作者：

Lu, Xiaofeng ^{[1
]}

Wang, Fei ^{[1
]}

Jiang, Cheng ^{[1
]}

Lio, Pietro ^{[2
]}

机构：

[1] Beijing Univ Posts & Telecommun, Sch Cyberspace Secur, Beijing 100876, Peoples R China

[2] Univ Cambridge, Comp Lab, Cambridge CB3 0FD, England

来源：

APPLIED SCIENCES-BASEL | 2021年 / 11卷 / 24期

基金：

中国国家自然科学基金; 国家重点研发计划;

关键词：

malicious document detection; static detection; feature generalization; machine learning;

D O I：

10.3390/app112412134

中图分类号：

O6 [化学];

学科分类号：

0703 ;

摘要：

In this study, Portable Document Format (PDF), Word, Excel, Rich Test format (RTF) and image documents are taken as the research objects to study a static and fast method by which to detect malicious documents. Malicious PDF and Word document features are abstracted and extended, which can be used to detect other types of documents. A universal static detection framework for malicious documents based on feature generalization is then proposed. The generalized features include specification check errors, the structure path, code keywords, and the number of objects. The proposed method is verified on two datasets, and is compared with Kaspersky, NOD32, and McAfee antivirus software. The experimental results demonstrate that the proposed method achieves good performance in terms of the detection accuracy, runtime, and scalability. The average F1-score of all types of documents is found to be 0.99, and the average detection time of a document is 0.5926 s, which is at the same level as the compared antivirus software.

引用

页数：23

共 50 条

[21] XAI-PDF: A Robust Framework for Malicious PDF Detection Leveraging SHAP-Based Feature Engineering
Al-Fayoumi, Mustafa
Abu Al-Haija, Qasem
Armoush, Rakan
Amareen, Christine
INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2024, 21 (01) : 128 - 146
[22] SFEM: Structural feature extraction methodology for the detection of malicious office documents using machine learning methods
Cohen, Aviad
Nissim, Nir
Rokach, Lior
Elovici, Yuval
EXPERT SYSTEMS WITH APPLICATIONS, 2016, 63 : 324 - 343
[23] A Framework for Malicious Domain Names Detection Using Feature Selection and Majority Voting Approach
Patil, Dharmaraj R.
Informatica (Slovenia), 2024, 48 (03): : 419 - 438
[24] Simulation on static detection of malicious code based on behavior information gain
Wei, Pengcheng
Shi, Chengxiang
He, Fangcheng
JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2020, 38 (06) : 7683 - 7692
[25] Enhancing Malicious URL Detection: A Novel Framework Leveraging Priority Coefficient and Feature Evaluation
Rafsanjani, Ahmad Sahban
Binti Kamaruddin, Norshaliza
Behjati, Mehran
Aslam, Saad
Sarfaraz, Aaliya
Amphawan, Angela
IEEE ACCESS, 2024, 12 : 85001 - 85026
[26] Evaluations of AI-based malicious PowerShell detection with feature optimizations
Song, Jihyeon
Kim, Jungtae
Choi, Sunoh
Kim, Jonghyun
Kim, Ikkyun
ETRI JOURNAL, 2021, 43 (03) : 549 - 560
[27] DDOFM: Dynamic malicious domain detection method based on feature mining
Wang H.
Tang Z.
Li H.
Zhang J.
Cai C.
Computers and Security, 2023, 130
[28] Malicious Accounts Detection in Online Social Networks Based on Feature Extraction
Yuan, Deyu
Chen, Shicong
Huang, Shuhua
Ye, Han
Sun, Haichun
BASIC & CLINICAL PHARMACOLOGY & TOXICOLOGY, 2020, 127 : 107 - 108
[29] Fast Model Learning for the Detection of Malicious Digital Documents
Scofield, Daniel
Miles, Craig
Kuhn, Stephen
PROCEEDINGS OF THE 7TH SOFTWARE SECURITY, PROTECTION, AND REVERSE ENGINEERING WORKSHOP 2017 (SSPREW), 2017,
[30] Neural α-feature detector for feature detection and generalization
Kamimura, R
Kanagawa, H
IEEE WORLD CONGRESS ON COMPUTATIONAL INTELLIGENCE, 1998, : 1845 - 1850

← 1 2 3 4 5 →