PDF Malware Detection Using Visualization and Machine Learning

被引:2
|
作者
Liu, Ching-Yuan [1 ]
Chiu, Min-Yi [2 ]
Huang, Qi-Xian [2 ]
Sun, Hung-Min [1 ]
机构
[1] Natl Tsing Hua Univ, Dept Comp Sci, Hsinchu, Taiwan
[2] Natl Tsing Hua Univ, Inst Informat Syst & Applicat, Hsinchu, Taiwan
关键词
Malware detection; PDF malware; Malware visualization; Machine learning;
D O I
10.1007/978-3-030-81242-3_12
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recently, as more and more disasters caused by malware have been reported worldwide, people started to pay more attention to malware detection to prevent malicious attacks in advance. According to the diversity of the software platforms that people use, the malware also varies pretty much, for example: Xcode Ghost on iOS apps, FakePlayer on Android apps, and WannaCrypt on PC. Moreover, most of the time people ignore the potential security threats around us while surfing the internet, processing files or even reading email. The Portable Document Format (PDF) file, one of the most commonly used file types in the world, can be used to store texts, images, multimedia contents, and even scripts. However, with the increasing popularity and demands of PDF files, only a small fraction of people know how easy it could be to conceal malware in normal PDF files. In this paper, we propose a novel technique combining Malware Visualization and Image Classification to detect PDF files and identify which ones might be malicious. By extracting data from PDF files and traversing each object within, we can obtain the holistic treelike structure of PDF files. Furthermore, according to the signature of the objects in the files, we assign different colors obtained from SimHash to generate RGB images. Lastly, our proposed model trained by the VGG19 with CNN architecture achieved up to 0.973 accuracy and 0.975 F1-score to distinguish malicious PDF files, which is viable for personal, or enterprise-wide use and easy to implement.
引用
收藏
页码:209 / 220
页数:12
相关论文
共 50 条
  • [41] Malware Detection on Android Smartphones using API Class and Machine Learning
    Westyarian
    Rosmansyah, Yusep
    Dabarsyah, Budiman
    [J]. 5TH INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING AND INFORMATICS 2015, 2015, : 294 - 297
  • [42] Zero-Day Malware Classification and Detection Using Machine Learning
    Kumar J.
    Rajendran B.
    Sudarsan S.D.
    [J]. SN Computer Science, 5 (1)
  • [43] A Survey on Android Malware Detection Techniques Using Machine Learning Algorithms
    Alqahtani, Ebtesam J.
    Zagrouba, Rachid
    Almuhaideb, Abdullah
    [J]. 2019 SIXTH INTERNATIONAL CONFERENCE ON SOFTWARE DEFINED SYSTEMS (SDS), 2019, : 110 - 117
  • [44] Permissions-Based Detection of Android Malware Using Machine Learning
    Akbar, Fahad
    Hussain, Mehdi
    Mumtaz, Rafia
    Riaz, Qaiser
    Wahab, Ainuddin Wahid Abdul
    Jung, Ki-Hyun
    [J]. SYMMETRY-BASEL, 2022, 14 (04):
  • [45] Machine Learning for Android Malware Detection Using Permission and API Calls
    Peiravian, Naser
    Zhu, Xingquan
    [J]. 2013 IEEE 25TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI), 2013, : 300 - 305
  • [46] MLDroid—framework for Android malware detection using machine learning techniques
    Arvind Mahindru
    A. L. Sangal
    [J]. Neural Computing and Applications, 2021, 33 : 5183 - 5240
  • [47] Efficient and Effective Static Android Malware Detection Using Machine Learning
    Bansal, Vidhi
    Ghosh, Mohona
    Baliyan, Niyati
    [J]. INFORMATION SYSTEMS SECURITY, ICISS 2022, 2022, 13784 : 103 - 118
  • [48] Detection of malware in downloaded files using various machine learning models
    Kamboj, Akshit
    Kumar, Priyanshu
    Bairwa, Amit Kumar
    Joshi, Sandeep
    [J]. EGYPTIAN INFORMATICS JOURNAL, 2023, 24 (01) : 81 - 94
  • [49] Analysis of Android Malware Detection Performance using Machine Learning Classifiers
    Ham, Hyo-Sik
    Choi, Mi-Jung
    [J]. 2013 INTERNATIONAL CONFERENCE ON ICT CONVERGENCE (ICTC 2013): FUTURE CREATIVE CONVERGENCE TECHNOLOGIES FOR NEW ICT ECOSYSTEMS, 2013, : 492 - 497
  • [50] Android Mobile Malware Detection Using Machine Learning: A Systematic Review
    Senanayake, Janaka
    Kalutarage, Harsha
    Al-Kadri, Mhd Omar
    [J]. ELECTRONICS, 2021, 10 (13)