Is It Overkill? Analyzing Feature-Space Concept Drift in Malware Detectors

被引:0
|
作者
Chen, Zhi [1 ]
Zhang, Zhenning [1 ]
Kan, Zeliang [2 ,3 ]
Yang, Limin [1 ]
Cortellazzi, Jacopo [2 ,3 ]
Pendlebury, Feargus [3 ]
Pierazzi, Fabio [2 ]
Cavallaro, Lorenzo [3 ]
Wang, Gang [1 ]
机构
[1] Univ Illinois, Urbana, IL 61081 USA
[2] Kings Coll London, London, England
[3] UCL, London, England
关键词
D O I
10.1109/SPW59333.2023.00007
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Concept drift is a major challenge faced by machine learning-based malware detectors when deployed in practice. While existing works have investigated methods to detect concept drift, it is not yet well understood regarding the main causes behind the drift. In this paper, we design experiments to empirically analyze the impact of feature-space drift (new features introduced by new samples) and compare it with data-space drift (data distribution shift over existing features). Surprisingly, we find that data-space drift is the dominating contributor to the model degradation over time while featurespace drift has little to no impact. This is consistently observed over both Android and PE malware detectors, with different feature types and feature engineering methods, across different settings. We further validate this observation with recent online learning based malware detectors that incrementally update the feature space. Our result indicates the possibility of handling concept drift without frequent feature updating, and we further discuss the open questions for future research.
引用
收藏
页码:21 / 28
页数:8
相关论文
共 50 条
  • [1] Feature-Space Bayesian Adversarial Learning Improved Malware Detector Robustness
    Doan, Bao Gia
    Yang, Shuiqiao
    Montague, Paul
    De Vel, Olivier
    Abraham, Tamas
    Camtepe, Seyit
    Kanhere, Salil S.
    Abbasnejad, Ehsan
    Ranasinghe, Damith C.
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 12, 2023, : 14783 - 14791
  • [2] Building a Feature-Space for Visual Surveillance
    Altahir, Altahir A.
    Asirvadam, Vijanth S.
    2014 5TH INTERNATIONAL CONFERENCE ON INTELLIGENT AND ADVANCED SYSTEMS (ICIAS 2014), 2014,
  • [3] Analyzing Hardware Based Malware Detectors
    Patel, Nisarg
    Sasan, Avesta
    Homayoun, Houman
    PROCEEDINGS OF THE 2017 54TH ACM/EDAC/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2017,
  • [4] Feature-space analysis of unstructured meshes
    Shamir, A
    IEEE VISUALIZATION 2003, PROCEEDINGS, 2003, : 185 - 192
  • [5] INTERACTIVE FEATURE-SPACE PARTITION CLASSIFIER
    BUCHMAN, PE
    PHOTOGRAMMETRIC ENGINEERING AND REMOTE SENSING, 1979, 45 (06): : 779 - 779
  • [6] Feature-space selection with banded ridge regression
    la Tour, Tom Dupre
    Eickenberg, Michael
    Nunez-Elizalde, Anwar O.
    Gallant, Jack L.
    NEUROIMAGE, 2022, 264
  • [7] Speaker Verification With Feature-Space MAPLR Parameters
    Zhu, Donglai
    Ma, Bin
    Li, Haizhou
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (03): : 505 - 515
  • [8] FOURIER-TRANSFORM FEATURE-SPACE STUDIES
    CASASENT, D
    SHARMA, V
    PROCEEDINGS OF THE SOCIETY OF PHOTO-OPTICAL INSTRUMENTATION ENGINEERS, 1984, 449 : 2 - 8
  • [9] A Feature-Space Theory of the Production Effect in Recognition
    Caplan, Jeremy B.
    Guitard, Dominic
    EXPERIMENTAL PSYCHOLOGY, 2024, 71 (01) : 64 - 82
  • [10] SIMPLE FEATURE-SPACE REPRESENTATION OF PARTICLE SHAPE
    DAVIES, R
    POWDER TECHNOLOGY, 1975, 12 (02) : 111 - 124