Malware Detection Using Byte Streams of Different File Formats

被引:1
|
作者
Jeong, Young-Seob [1 ]
Lee, Sang-Min [2 ]
Kim, Jong-Hyun [2 ]
Woo, Jiyoung [3 ]
Kang, Ah Reum [4 ]
机构
[1] Chungbuk Natl Univ, Dept Comp Engn, Cheongju 28644, South Korea
[2] Elect & Telecommun Res Inst, Daejeon 34129, South Korea
[3] Soonchunhyang Univ, Dept Big Data Engn, Asan 31538, South Korea
[4] Pai Chai Univ, Dept Informat Secur, Daejeon 35345, South Korea
关键词
Malware; Task analysis; Portable document format; Training; Analytical models; Support vector machines; Numerical models; Malware detection; byte stream; non-executables; deep learning; convolutional neural networks; Hangul word processor; portable document format;
D O I
10.1109/ACCESS.2022.3171775
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Malware detection is becoming more important task as we face more data on the Internet. Web users are vulnerable to non-executable files such as Word files and Hangul Word Processor files because they usually open such files without paying attention. As new infected non-executables keep appearing, deep-learning models are drawing attention because they are known to be effective and have better generalization power. Especially, the deep-learning models have been used to learn arbitrary patterns from byte streams, and they exhibited successful performance on malware detection task. Although there have been malware detection studies using the deep-learning models, they commonly aimed at a single file format and did not take using different formats into consideration. In this paper, we assume that different file formats may contribute to each other, and deep-learning models will have a better chance to learn more promising patterns for better performance. We demonstrate that this assumption is possible by experimental results with our annotated datasets of two different file formats (e.g., Portable Document Format (PDF) and Hangul Word Processor (HWP)).
引用
收藏
页码:51041 / 51047
页数:7
相关论文
共 50 条
  • [1] File-level malware detection using byte streams
    Young-Seob Jeong
    Medard Edmund Mswahili
    Ah Reum Kang
    [J]. Scientific Reports, 13
  • [2] File-level malware detection using byte streams
    Jeong, Young-Seob
    Mswahili, Medard Edmund
    Kang, Ah Reum
    [J]. SCIENTIFIC REPORTS, 2023, 13 (01)
  • [3] Malware Detection on Byte Streams of PDF Files Using Convolutional Neural Networks
    Jeong, Young-Seob
    Woo, Jiyoung
    Kang, Ah Reum
    [J]. SECURITY AND COMMUNICATION NETWORKS, 2019, 2019
  • [4] Malware Detection on Byte Streams of Hangul Word Processor Files
    Jeong, Young-Seob
    Woo, Jiyoung
    Kang, Ah Reum
    [J]. APPLIED SCIENCES-BASEL, 2019, 9 (23):
  • [5] A Glimpse at Different File Formats
    Shortridge, Keith
    [J]. ASTRONOMICAL DATA ANALYSIS SOFTWARE AND SYSTEMS: XXIV, 2015, 495 : 527 - 530
  • [6] Detection of Faces from Video Files with Different File Formats
    Dutta, Pranti
    Nachamai, M.
    [J]. 2016 INTERNATIONAL CONFERENCE ON MICROELECTRONICS, COMPUTING AND COMMUNICATIONS (MICROCOM), 2016,
  • [7] Malware Classification using Byte Sequence Information
    Jung, Byungho
    Kim, Taeguen
    Im, Eul Gyu
    [J]. PROCEEDINGS OF THE 2018 CONFERENCE ON RESEARCH IN ADAPTIVE AND CONVERGENT SYSTEMS (RACS 2018), 2018, : 143 - 148
  • [8] ByteFreq: Malware Clustering Using Byte Frequency
    Singh, Nirmal
    Khurmi, Sawtantar Singh
    [J]. 2016 5TH INTERNATIONAL CONFERENCE ON RELIABILITY, INFOCOM TECHNOLOGIES AND OPTIMIZATION (TRENDS AND FUTURE DIRECTIONS) (ICRITO), 2016, : 333 - 337
  • [9] A Malware Variant Detection Method Based on Byte Randomness Test
    Qi, Shuhui
    Xu, Ming
    Zheng, Ning
    [J]. JOURNAL OF COMPUTERS, 2013, 8 (10) : 2469 - 2477
  • [10] Byte Level n-Gram Analysis for Malware Detection
    Jain, Sacbin
    Meena, Yogesb Kumar
    [J]. COMPUTER NETWORKS AND INTELLIGENT COMPUTING, 2011, 157 : 51 - 59