A semantics-based approach to malware detection

被引:39
|
作者
Preda, Mila Dalla [1 ]
Christodorescu, Mihai [2 ]
Jha, Somesh [2 ]
Debray, Saumya [3 ]
机构
[1] Univ Verona, I-37100 Verona, Italy
[2] Univ Wisconsin, Madison, WI 53706 USA
[3] Univ Arizona, Tucson, AZ USA
基金
美国国家科学基金会;
关键词
security; languages; theory; verification; malware detection; obfuscation; trace semantics; abstract interpretation;
D O I
10.1145/1387673.1387674
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Malware detection is a crucial aspect of software security. Current malware detectors work by checking for signatures, which attempt to capture the syntactic characteristics of the machine-level byte sequence of the malware. This reliance on a syntactic approach makes current detectors vulnerable to code obfuscations, increasingly used by malware writers, that alter the syntactic properties of the malware byte sequence without significantly affecting their execution behavior. This paper takes the position that the key to malware identification lies in their semantics. It proposes a semantics-based framework for reasoning about malware detectors and proving properties such as soundness and completeness of these detectors. Our approach uses a trace semantics to characterize the behavior of malware as well as that of the program being checked for infection, and uses abstract interpretation to "hide" irrelevant aspects of these behaviors. As a concrete application of our approach, we show that (1) standard signature matching detection schemes are generally sound but not complete, (2) the semantics-aware malware detector proposed by Christodorescu et al. is complete with respect to a number of common obfuscations used by malware writers and (3) the malware detection scheme proposed by Kinder et al. and based on standard model-checking techniques is sound in general and complete on some, but not all, obfuscations handled by the semantics-aware malware detector.
引用
收藏
页数:54
相关论文
共 50 条
  • [1] A semantics-based approach to Malware detection
    Preda, Mila Dalla
    Christodorescu, Mihai
    Jha, Somesh
    Debray, Saumya
    ACM SIGPLAN NOTICES, 2007, 42 (01) : 377 - 388
  • [2] A Semantics-Based Approach to Malware Detection
    Preda, Mila Dalla
    Christodorescu, Mihai
    Jha, Somesh
    Debray, Saumya
    CONFERENCE RECORD OF POPL 2007: THE 34TH ACM SIGPLAN SIGACT SYMPOSIUM ON PRINCIPLES OF PROGAMMING LANGUAGES, 2007, : 377 - 388
  • [3] A New Semantics-Based Android Malware Detection
    Zhang, Xiaohan
    Jin, Zhengping
    2016 2ND IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATIONS (ICCC), 2016, : 1412 - 1416
  • [4] Apposcopy: Semantics-Based Detection of Android Malware through Static Analysis
    Feng, Yu
    Anand, Saswat
    Dillig, Isil
    Aiken, Alex
    22ND ACM SIGSOFT INTERNATIONAL SYMPOSIUM ON THE FOUNDATIONS OF SOFTWARE ENGINEERING (FSE 2014), 2014, : 576 - 587
  • [5] TextDroid: Semantics-based Detection of Mobile Malware Using Network Flows
    Wang, Shanshan
    Yan, Qiben
    Chen, Zhenxiang
    Yang, Bo
    Zhao, Chuan
    Conti, Mauro
    2017 IEEE CONFERENCE ON COMPUTER COMMUNICATIONS WORKSHOPS (INFOCOM WKSHPS), 2017, : 18 - 23
  • [6] Semantics-Based Online Malware Detection: Towards Efficient Real-Time Protection Against Malware
    Das, Sanjeev
    Liu, Yang
    Zhang, Wei
    Chandramohan, Mahintham
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2016, 11 (02) : 289 - 302
  • [7] A Semantics-Based Approach on Binary Function Similarity Detection
    Zhang, Yuntao
    Fang, Binxing
    Xiong, Zehui
    Wang, Yanhao
    Liu, Yuwei
    Zheng, Chao
    Zhang, Qinnan
    IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (15): : 25910 - 25924
  • [8] Searching the web: A semantics-based approach
    Cao, TH
    Nguyen, THD
    Qui, TCT
    MODELLING, SIMULATION AND OPTIMIZATION OF COMPLEX PROCESSES, 2005, : 57 - 68
  • [9] Memoized Semantics-Based Binary Diffing with Application to Malware Lineage Inference
    Ming, Jiang
    Xu, Dongpeng
    Wu, Dinghao
    ICT SYSTEMS SECURITY AND PRIVACY PROTECTION, 2015, 455 : 416 - 430
  • [10] A Semantics-Based Approach to Retrieving Biomedical Information
    Andreasen, Troels
    Bulskov, Henrik
    Zambach, Sine
    Lassen, Tine
    Madsen, Bodil Nistrup
    Jensen, Per Anker
    Thomsen, Hanne Erdman
    Nilsson, Jorgen Fischer
    FLEXIBLE QUERY ANSWERING SYSTEMS, 2011, 7022 : 108 - +