Malware classification using word embeddings algorithms and long-short term memory networks

被引:3
|
作者
Andrade, Eduardo de O. [1 ]
Viterbo, Jose [1 ]
Guerin, Joris [2 ]
Bernardini, Flavia [1 ]
机构
[1] Fluminense Fed Univ, Inst Comp, Rio De Janeiro, Brazil
[2] Toulouse Univ, LAAS CNRS, Midi Pyrenees, France
关键词
deep learning; malware classification; malware detection; recurrent neural network; word embedding; NEURAL-NETWORKS; LSTM;
D O I
10.1111/coin.12543
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The number of malicious software applications, or malware programs, increases every year. Their development becomes more sophisticated as new techniques are used to bypass program scanning software applications, such as antiviruses. Thereby, deep learning-based methods emerge as a new promising way to identify these threats. Our main purpose and contribution in this work is proposing and implementing a successful approach to tackle both binary and multiclass malware classification problems. We used unsupervised word embedding algorithms for representing software applications to be analyzed and long-short term memory for classifying the software applications. For evaluating our pipeline, we introduce a new dataset for binary and multiclass malware classification because we could not find large datasets containing sufficient samples of cleanware and the various malware types for multiclass classification that could be used to evaluate classification models. Our experimental results reached an accuracy of 88.94% for binary classification and 75.13% for multiclass classification. These results suggest that the proposed dataset is challenging, and using it can help in the training of better malware classifiers, improving security.
引用
收藏
页码:1802 / 1830
页数:29
相关论文
共 50 条
  • [1] Zero Shot Intent Classification Using Long-Short Term Memory Networks
    Williams, Kyle
    [J]. INTERSPEECH 2019, 2019, : 844 - 848
  • [2] Word embeddings and recurrent neural networks based on Long-Short Term Memory nodes in supervised biomedical word sense disambiguation
    Yepes, Antonio Jimeno
    [J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2017, 73 : 137 - 147
  • [3] Classification of coma etiology using convolutional neural networks and long-short term memory networks
    Baldo Junior, Sergio
    Carneiro, Murillo G.
    Destro-Filho, Joao-Batista
    Zhao, Liang
    Tinos, Renato
    [J]. 2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [4] Malware Classification using Long Short-term Memory Models
    Dang, Dennis
    Di Troia, Fabio
    Stamp, Mark
    [J]. ICISSP: PROCEEDINGS OF THE 7TH INTERNATIONAL CONFERENCE ON INFORMATION SYSTEMS SECURITY AND PRIVACY, 2021, : 743 - 752
  • [5] CONVOLUTIONAL LONG-SHORT TERM MEMORY NETWORKS MODEL FOR LONG DURATION EEG SIGNAL CLASSIFICATION
    Baloglu, Ulas Baran
    Yildirim, Ozal
    [J]. JOURNAL OF MECHANICS IN MEDICINE AND BIOLOGY, 2019, 19 (01)
  • [6] Deep Learning Based Automatic Modulation Classification With Long-Short Term Memory Networks
    Karahan, Sumeye Nur
    Kalaycioglu, Aykut
    [J]. 2020 28TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2020,
  • [7] Photonic Long-Short Term Memory Neural Networks with Analog Memory
    Howard, Emma R.
    Marquez, Bicky A.
    Shastri, Bhavin J.
    [J]. 2020 IEEE PHOTONICS CONFERENCE (IPC), 2020,
  • [8] Twitter Bot Detection Using Bidirectional Long Short-term Memory Neural Networks and Word Embeddings
    Wei, Feng
    Uyen Trang Nguyen
    [J]. 2019 FIRST IEEE INTERNATIONAL CONFERENCE ON TRUST, PRIVACY AND SECURITY IN INTELLIGENT SYSTEMS AND APPLICATIONS (TPS-ISA 2019), 2019, : 101 - 109
  • [9] Individualized Location Prediction Using Autoencoders and Long-Short Term Memory Networks
    Onwujekwe, Gerald
    Men, Zibo
    Duke, Joseph
    [J]. 2024 IEEE 3RD INTERNATIONAL CONFERENCE ON COMPUTING AND MACHINE INTELLIGENCE, ICMI 2024, 2024,
  • [10] Short Term Prediction of Wind Speed Based on Long-Short Term Memory Networks
    Salman, Umar T.
    Rehman, Shafiqur
    Alawode, Basit
    Alhems, Luai M.
    [J]. FME TRANSACTIONS, 2021, 49 (03): : 643 - 652