Multi-labeling of complex, multi-behavioral malware samples

被引:2
|
作者
Garcia-Teodoro, P. [1 ]
Gomez-Hernandez, J. A. [1 ]
Abellan-Galera, A. [1 ]
机构
[1] Univ Granada, Network Engn & Secur Grp, Granada, Spain
关键词
Android; Behavior; Dataset; Labeling; Malware; CLASSIFICATION;
D O I
10.1016/j.cose.2022.102845
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The use of malware samples is usually required to test cyber security solutions. For that, the correct typology of the samples is of interest to properly estimate the exhibited performance of the tools under evaluation. Although several malware datasets are publicly available at present, most of them are not labeled or, if so, only one class or tag is assigned to each malware sample. We defend that just one label is not enough to represent the usual complex behavior exhibited by most of current malware. With this hypothesis in mind, and based on the varied classification generally provided by automatic detection engines per sample, we introduce here a simple multi-labeling approach to automatically tag the usual multiple behavior of malware samples. In the paper, we first analyze the coherence between the behaviors exhibited by a specific number of well-known malware samples dissected in the literature and the multiple tags provided for them by our labeling proposal. After that, the automatic multi-labeling scheme is executed over four public Android malware datasets, the different results and statistics obtained regarding their composition and representativeness being discussed. We share in a GitHub repository the multi-labeling tool developed, for public usage. (C) 2022 The Author(s). Published by Elsevier Ltd.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] A HIERARCHICAL ALGORITHM FOR IMAGE MULTI-LABELING
    Hu, Jiwei
    Lam, Kin Man
    Qiu, Guoping
    [J]. 2010 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, 2010, : 2349 - 2352
  • [2] Epitomized Priors for Multi-labeling Problems
    Warrell, Jonathan
    Prince, Simon J. D.
    Moore, Alastair P.
    [J]. CVPR: 2009 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-4, 2009, : 2804 - 2811
  • [3] Periodic Multi-labeling of Public Transit Lines
    Polishchuk, Valentin
    Vihavainen, Arto
    [J]. GEOGRAPHIC INFORMATION SCIENCE, 2010, 6292 : 175 - 188
  • [4] Multi-labeling with topic models for searching security information
    Yuki Osada
    Ryusei Nagasawa
    Yoshiaki Shiraishi
    Makoto Takita
    Keisuke Furumoto
    Takeshi Takahashi
    Masami Mohri
    Masakatu Morii
    [J]. Annals of Telecommunications, 2022, 77 : 777 - 788
  • [5] Multi-labeling with topic models for searching security information
    Osada, Yuki
    Nagasawa, Ryusei
    Shiraishi, Yoshiaki
    Takita, Makoto
    Furumoto, Keisuke
    Takahashi, Takeshi
    Mohri, Masami
    Morii, Masakatu
    [J]. ANNALS OF TELECOMMUNICATIONS, 2022, 77 (11-12) : 777 - 788
  • [6] Development of a Multi-Behavioral mHealth App for Women Smokers
    Armin, Julie
    Johnson, Thienne
    Hingle, Melanie
    Giacobbi, Peter, Jr.
    Gordon, Judith S.
    [J]. JOURNAL OF HEALTH COMMUNICATION, 2017, 22 (02) : 153 - 162
  • [7] Multi-behavioral Multi-Robot Systems driven by Motivation Dynamics
    Baxevani, Kleio
    Tanner, Herbert G.
    [J]. 2023 AMERICAN CONTROL CONFERENCE, ACC, 2023, : 1467 - 1472
  • [8] MULTI-BEHAVIORAL DETERMINANTS OF WEIGHT LOSS IN MEN AND WOMEN
    Ramirez, Ernesto
    Norman, Gregory J.
    Merchant, Gina
    Sallis, James F.
    Calfas, Karen J.
    Patrick, Kevin
    [J]. ANNALS OF BEHAVIORAL MEDICINE, 2011, 41 : S101 - S101
  • [9] Arabic text classification: the need for multi-labeling systems
    El Rifai, Hozayfa
    Al Qadi, Leen
    Elnagar, Ashraf
    [J]. NEURAL COMPUTING & APPLICATIONS, 2022, 34 (02): : 1135 - 1159
  • [10] Arabic text classification: the need for multi-labeling systems
    Hozayfa El Rifai
    Leen Al Qadi
    Ashraf Elnagar
    [J]. Neural Computing and Applications, 2022, 34 : 1135 - 1159