Multi-View Malicious Document Detection

被引:6
|
作者
Lin, Jing-Yao [1 ]
Pao, Hsing-Kuo [1 ]
机构
[1] Natl Taiwan Univ Sci & Technol, Dept Comp Sci & Informat Engn, Taipei, Taiwan
关键词
malicious document; multi-view; exploit; vulnerability; PDF;
D O I
10.1109/TAAI.2013.43
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Malicious document is one of the most notorious components of modern attacks. The document may appear normal in its format, but behave strangely or beyond users' expectation, sometimes lead to severe consequences when it is opened. Detecting malicious documents tops one of the most important tasks in modern information security. Malicious documents usually contain specific control codes inside which may cause the malicious shellcode be executed automatically. The document control code is originally designed to enrich the documents' functionalities; but in this case, it may create vulnerabilities and then become a key to trigger attacks. Detecting control codes of certain pattern is a key to the success of malicious document detection. Different from previous research that was focused on detecting malicious documents of a particular format or containing specific control codes, we propose a method that analyzes the document objects from three general views: the use of functional words, preference words, and constant data. The functional words control how an attack is launched, and through what actions, if the document is considered a malicious one; the preference words usually suggest the favored word choices from document authors; and the constant data can be considered the bullets to complete the attack. We also propose a TF-IDF method to normalize the features to detect documents with mimicry attacks. Overall, given the three feature views, we detect malicious documents under a classification framework. We evaluate the proposed approach through series of experiments that use different view combinations for prediction, followed by some comparison of the proposed method to related work.
引用
收藏
页码:170 / 175
页数:6
相关论文
共 50 条
  • [1] Malicious Word Document Detection Based On Multi-View Features Learning
    Lu Xiaofeng
    Wang Fei
    Shu Zifeng
    [J]. 2019 28TH INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATION AND NETWORKS (ICCCN), 2019,
  • [2] Multi-view encryption malicious traffic detection method combined with co-training
    Huo, Yuehua
    Wu, Wenhao
    Zhao, Faqi
    Wang, Qiang
    [J]. Xi'an Dianzi Keji Daxue Xuebao/Journal of Xidian University, 2023, 50 (04): : 139 - 147
  • [3] A Multi-view Graph Learning Approach for Host-Based Malicious Behavior Detection
    Zhao, Chenfei
    Zhang, Zhe
    Wu, Tiejun
    Fan, Dunqiu
    [J]. DATABASE SYSTEMS FOR ADVANCED APPLICATIONS. DASFAA 2023 INTERNATIONAL WORKSHOPS, BDMS 2023, BDQM 2023, GDMA 2023, BUNDLERS 2023, 2023, 13922 : 283 - 299
  • [4] Graph based encrypted malicious traffic detection with hybrid analysis of multi-view features
    Hong, Yueping
    Li, Qi
    Yang, Yanqing
    Shen, Meng
    [J]. INFORMATION SCIENCES, 2023, 644
  • [5] ECNet: Robust Malicious Network Traffic Detection With Multi-View Feature and Confidence Mechanism
    Han, Xueying
    Liu, Song
    Liu, Junrong
    Jiang, Bo
    Lu, Zhigang
    Liu, Baoxu
    [J]. IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2024, 19 : 6871 - 6885
  • [6] Multi-view document rectification using boundary
    Tsoi, Yau-Chat
    Brown, Michael S.
    [J]. 2007 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-8, 2007, : 2158 - +
  • [7] A multi-view context-aware approach to Android malware detection and malicious code localization
    Annamalai Narayanan
    Mahinthan Chandramohan
    Lihui Chen
    Yang Liu
    [J]. Empirical Software Engineering, 2018, 23 : 1222 - 1274
  • [8] A multi-view context-aware approach to Android malware detection and malicious code localization
    Narayanan, Annamalai
    Chandramohan, Mahinthan
    Chen, Lihui
    Liu, Yang
    [J]. EMPIRICAL SOFTWARE ENGINEERING, 2018, 23 (03) : 1222 - 1274
  • [9] Multi-view Document Classification with Co-training
    Sevim, Semih
    Ekinci, Ekin
    Ilhan Omurca, Sevinc
    [J]. 2020 28TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2020,
  • [10] A Novel Multi-view Object Class Detection Framework for Document Image Content Analysis
    Yin, Weichong
    Lu, Tong
    Su, Feng
    [J]. 2013 12TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2013, : 1095 - 1099